AMERICAN 
JOURNAL OF MATHEMATICS 


FOUNDED BY THE JOHNS HOPKINS UNIVERSITY 


EDITED BY 
G. D. BIRKHOFF H. WEYL 
HARVARD UNIVERSITY THE INSTITUTE FOR ADVANCED STUDY 


F. D. MURNAGHAN H. WHITNEY 
THE JOHNS HOPKINS UNIVERSITY HARVARD UNIVERSITY 
A. WINTNER 
THE JOHNS HOPKINS UNIVERSITY 


WITH THE COOPERATION OF 


. P. AGNEW V. G. GROVE R. BRAUER 

. CHEVALLEY M. H. HEINS J. DOUGLAS 

. A. HEDLUND D. C. LEWIS W. HUREWICZ 
3. B. MYERS T. RADO N. LEVINSON 

E. STEENROD H. WALL G. PALL 


PUBLISHED UNDER THE JOINT AUSPICES OF 
THE JOHNS HOPKINS UNIVERSITY 
AND 
THE AMERICAN MATHEMATICAL SOCIETY 


VOLUME LXVI 
1944. 


THE JOHNS HOPKINS PRESS 
BALTIMORE 18. MARYLAND 
U. S. A. 


| 
| 


| 


FEB 29 


AMERICAN 
JOURNAL OF MATHEMATICS 


FOUNDED BY THE JOHNS HOPKINS UNIVERSITY 


EDITED BY 


G. D. BIRKHOFF H. WEYL 
HARVARD UNIVERSITY THE INSTITUTE FOR ADVANCED STUDY 


F, D. MURNAGHAN H. WHITNEY 
THE JOHNS HOPKINS UNIVERSITY HARVARD UNIVERSITY 
A. WINTNER 
THE JOHNS HOPKINS UNIVERSITY 


WITH THE COOPERATION OF 


R. P. AGNEW V. G. GROVE R. BRAUER 
C. CHEVALLEY M. H. HEINS J. DOUGLAS 
G. A. HEDLUND D. C. LEWIS W. HUREWICZ 
S. B. MYERS T. RADO N. LEVINSON 
N. E. STEENROD H. S. WALL G. PALL 


PUBLISHED UNDER THE JOINT AUSPICES OF 


THE JOHNS HOPKINS UNIVERSITY 
AND 


THE AMERICAN MATHEMATICAL SOCIETY 


Volume LXVI, Number 1 
JANUARY, 1944 


THE JOHNS HOPKINS PRESS 
BALTIMORE, MARYLAND 
U. S. A. 


gat? 

i 

| 


CONTENTS 


An extension of Galois theory to non-normal and non-separable fields. 


Algebras derived by non-associative matrix multiplication. By A. A. 


On the forms of the predicates in the theory of constructive ordinals. 


The resultant of a linear set. By Ernst SNAPPER, 
r-regular convergence spaces. By Paut A. WHITE, 
Universal functions of polygonal numbers, II. By L. W. Grirritus, 


The projective theory of surfaces in ruled space, II. By CHENxkuvo Pa, 


A generalization of associate quadrics of a surface. By CHENKUO Pa, 


Generalization of Waring’s problem to algebraic number fields. By 
Cart Lupwie SIEGEL, . 


An unsolved case of the Waring problem. By Ivan NIveEn, , ; 


The Fan integrals interpreted as measures in a product-space. By A. J. 
WARD, . 


The AMERICAN JOURNAL OF MATHEMATICS will appear four times yearly. 

The subscription price of the JouRNAL for the current volume is $7.50 (foreign 
postage 50 cents) ; single numbers $2.00. 

A few complete sets of the JOURNAL remain on sale. 

Papers intended for publication in the JouRNAL may be sent to any of the Editors, 

Editorial communications may be sent to Professor F, D. MURNAGHAN at The Johns 
Hopkins University. 

Subscriptions to the JouRNA and all business communications should be sent to 
THE JoHNs Hopkins PRESS, BALTIMORE, MARYLAND, U.S. A. 


Entered as second-class matter at the Baltimore, Maryland, Postoffice, acceptance for mailing at special 
rate of postage provided for in Section 1103, Act of October 8, 1917, Authorized on July 3, 1918. 


PRINTED IN THE UNITED STATES OF AMERICA 
BY J. H, FURST COMPANY, BALTIMORE, MARYLAND 


PAGE 
59 
‘ 69 
97 4 
101 
115 | 
137 


‘ 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL AND 
NON-SEPARABLE FIELDS.* ! 


By N. 


As is well known, modern Galois theory is a theory of the automorphisms 
of an arbitrary field P. Its principal result is the establishment of a (1— 1) 
correspondence between the finite groups g of automorphisms in P and the 
subfields ®g over which P is finite, separable and normal, such that 1) g: = ge 
if and only if ®g,=%g, and 2) g» is invariant in g, if and only if ®g, 
is normal over @g,. A second type of Galois theory has been given by the 
author in a previous paper.” It associates the subfields ® of a field P of char- 
acteristic p over which P has the form ®(2,,: --+,%m) where vj? = in ® 
with certain algebras of derivations in P. In seeking an extension of these 
theories we have been led to a study of the self-representations of P, i.e. the 
representations of P by matrices with elements in this field.* One is led to 
seek an extension along these lines by the following two remarks. First, if S 
is an automorphism, then a— a is a self-representation by 1-dimensional 


matrices and second, if J is a derivation, then 


0 «@ 

is a representation by 2-rowed matiices. 

Now by our definition of a self-representation any isomorphism of P into 
a subfield is a self-representation. It is therefore desirable to restrict the 

* Received July 12, 1943. 

* Presented to the Society, September 11, 1943. 

*“ Abstract derivation and Lie algebras,” Transactions of the American Mathe- 
matical Society, vol. 43 (1937), p. 220. 

*A Galois theory of separable extensions which makes use of the concept of self- 
representation has heen announced recently by L, Kaloujnine, “Sur la théorie de 
Galois des corps non galoisiens séparables,” Comptes Rendus de lV’ Académie des Sciences, 
vol. 214 (1942), pp. 597-599. (Abstracted in Mathematical Reviews, vol. 4 (1943), 
p- 130). It appears that Kaloujnine’s results are special cases of those obtained 
here. See 12. It should be mentioned also that Galois theory for separable extensions 
based on the classical theory for separable normal extensions had been developed pre- 
viously. See Krasner, M. “ Sur la théorie de la ramification des ideaux des corps non- 
-galoisiens de nombres algebriques,” T'hése, Paris, 1938. The present theory (and 
apparently that of Kaloujnine) is independent of the classical theory. 


1 


N. 


JACOBSON. 


class of self-representations in such a way that any self-representation in this 
class having rank one (one-rowed) is an automorphism of P. This has been 
done by defining non-singular self-representations. There is a natural way 
to define the product of two self-representations: One substitutes for the 
elements of the matrices of one representation the matrices representing these 
elements in the second representation. This, of course, is a generalization 
of the product of isomorphisms. 

With any self-representation—or more exactly, with any class of similar 
self-representations—we may associate a double P-module # having the 
property that its right dimensionality (3t:P-) =m < o. If, in addition, 
(#:P:) =m, then & is said to be non-singular. Modules of this type corre- 
spond to non-singular self-representations. One may also define a product 
for double modules corresponding to the product of self-representations. 

Now with each self-representation we may associate a composite (ring) 
of the field P with itself. This consists of the ring of transformations in P 
generated by the scalar multiplications « ~ ar and > aa. These composites 
may also be defined abstractly: A system (K,S,7) is a composite of P with 
itself if K is a ring and S and T are isomorphisms between P and subfields 
PS and of K such that 1) K PSP’. 2) K is commutative, 3) 15 17, 
4) (K:P?) < ow. If (K’,8’,T’) is a second composite and the mapping 
Sa’SBT is a homomorphism, then (K, S,7') is a cover of (K’, S’, T’) 
((K,S,T) = (K’,S’,T’)). If this mapping is an isomorphism, the two 
composites are equivalent. Throughout our discussion equivalent composites 
are identified. One may define non-singular composites and the product 
(K,S,T) X (L,U,V) of any two composites. A composite (K,S,7) is 
called a Galois composite if (K,S,7') = (K,S8,T) X (K,8,T), (K,8,T) 
= the identity composite (P°,U,U) and (K,S8,T) = (K,T,8). Our main 
result is the establishment of a (1—1) correspondence between the Galois 
composites of P and the subfields ®r of P over which P is finite, such that 
= if and only if r,. 

Any composite may be decomposed in a certain sense into indecomposable 
composites (Ki, Si,7;). Conditions that (K,S,7) be a Galois composite 
may be given in terms of the components (Ki, Si,T;). If each Kj is a field, 
(K,8,7T) is called semi-simple. In this case the condition that (K,S,7) be 
a Galois composite is that the components (Ki, Si,7T;) form a hypergroup 
relative to the product (Ki, Si, T;) (Kj, S;,7j) defined to be the set of com- 
ponents of the composite (Ki, Si,Ti) X (K;j,8j,T;). Our fundamental 
correspondence between composites and subfields induces a (1—1) corre- 
spondence between semi-simple Galois composites and subfields ® over which 


2 
i 
& 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL FIELDS. 3 


P is finite and separable. We investigate also the fields © over which P is 
normal. Combining our results for normal and separable fields we obtain a 
(1—1) correspondence between the semi-simple composites whose hyper- 
groups are groups and the subfields ® over which P is finite, separable and 
normal. Finally, it is easy to obtain the connection between these composites 
and finite groups of automorphisms and this gives the classical theorem. 
Our results could have been formulated in terms of self-representations 
but this seems to be unnatural except ‘in the case where P has a primitive 
element over ©. In a later paper we hope to investigate in greater detail the 


theory for subfields ® over which P is purely inseparable. 


1. Self-representations of fields. If P is an arbitrary field, we define a 
self-representation of P as an isomorphism between P and a field of m-rowed 
matrices with elements in P such that 1— 1 the identity matrix. The integer 
nu is the rank of the representation. If LF: «>a in Pm is a self-representa- 
tion, we define /#;; as the transformation sending « into the element in the 
i-th row and j-th column of ¢”,4 Thus a = (aH;;) and the Fi; satisfy the 


following conditions: 


(1) = «Hi; + BEi;, (4B) Li; = in) (BE);), 
1E i; §;; 


Conversely any m* transformations #;; in P that satisfy (1) determine a self- 
representation «— (ai;) of P. 

We shall call y a fixed element under F if y® is the diagonal matrix 
y and y#Hi;j = 0 Evidently 


Hence y is fixed if yHii 
the set of fixed elements is a subfield of P. 

We shall recall now some ‘of the important concepts from general repre- 
sentation theory. ‘Two self-representations / and F are similar if there exists 
a fixed non-singular matrix S = (0;i;) such that a” = S-'e”§S for all «. Similar 
representations evidently have the same fixed elements. A representation is 
reducible if it is similar to a representation F in which Fj; = 0 fori >r>0 


and 7=r. In this case the correspondences k, l=1,- and 
= r+1,:--,m are self-representations. The first of these 


is a subrepresentation and the second a difference representation of FE. If 
besides Fi; = 0 for i>r and jr we have Fi; =0 for 1=r and j >r, 
then is decomposable into the components (aFx1), (@F' pq). Ina 
similar manner decomposability into more than two components may be de- 


*P,, denotes the ring of m X m matrices with elements in P. 


Ss 

» 

| 

2 


4 N. JACOBSON. 


fined. We shall write ¥ = £, + #F.+.---+ Es when £ is decomposable into 
the components Hj. If H=WH,+--+-+ EF, where the £; are irreducible, 
E is said to be completely reducible. If an element y of P is a fixed element 
relative to H, it is fixed under any subrepresentation and under any difference 


representation of 


2. Double P-modules: We shall now give a module formulaticn of the 
representation problem. We define a dowble P-module*® as a commutative 
group # which is both a right P-module and a left P-module such that 


i. 
2. (axr)B = 


Thus if we denote the endomorphism «— ax by a and the endomorphism 
by a, then = for all «, in P. We denote the field of 
endomorphisms @:(a%,-) by P:(P,). For our purposes it suffices to consider 


only double P-modules that satisfy the following condition 
3. P,) =m < 


Hence from now on we use the term “ double P-module ” for “ double P-module 
satisfying condition 3.” 

Let 2,- - -,2m be a basis for R over P,. Then ar; = Saja;; and it is 
readily verified that the correspondence — (aij) is a self-representation 
of P. The rank of 2 is m. Conversely if F is any self-representation, then 
we let 9 be a right P-module such that (R:P,-) =m. If %,:-+,%m is a 
basis for we write = and define ax = Sajnj where nj = &. 
Then it is readily verified that # is a double P-module and ax; = 2;(aLj;). 
Thus any self-representation is obtained from some double P-module in the 
manner indicated. If Yi = is a second basis over P, of the 
double P-module then ay; = Syj(aFj;) and (a«Fi;) where 
S = (oi;). Hence the different right bases for R correspond to the different 
self-representations similar to L. 

It is clear that an element y of P is a fixed element under # if and only 


if yr = yr. 
If S is a submodule of #, ay and yae S for any y in © and any @ in P. 


5Cf. E. Noether, “ Hyperkomplexe Gréssen und Darstellungstheorie,’ Mathe- 
matische Zeitschrift, vol. 30 (1929), p. 669; v. d. Waerden, Moderne Algebra, vol. 2, 
p. 131; or Jacobson, The Theory of Rings, New York, 1943, p. 95 (referred to here- 


after as R). 


e 
i 
| 
% 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL FIELDS. 
We may choose a basis Ym of over P, such that yi, - yr is a basis 
for S over P,. Then ay = k,l —=1,---,r. Hence E is reducible 


and the representation / determined by © is a subrepresentation of #. Also 
we have = XYq%qp (mod S) for Hence the repre- 
sentation G determined by the difference P-module R—S is a difference 
representation of /. In a similar manner we see that the condition that # 
be decomposable is that # be a direct sum of submodules ~ 0 and the condition 
that L be completely reducible is that # be a completely reducible double 
module. 


3. Composites. If 3 is a double P-module, the set of transformations 
that are finite sums of products «8, is a ring P;P; = P,P;. The elements of 
P,P, belong to & the algebra of linear transformations of #t over P,. Now we 
recall that the scalar multiplications in the algebra 2 are the mappings 
A— Aa,=a,A and that the dimensionality of 2 over P, is m*. Since 
= P,, P:P, is a subalgebra of &. Hence (P/P,: S m?. 

We now define a composite of a field with itself as a system (K, 8S, T) 
consisting of a ring K and two isomorphisms S and JT between P and subfields 
PS and P? of K such that the following conditions hold: 


1. 

= for all a, B 
3. 


Evidently by 1. and 2., A is a commutative ring and by 1. and 3. 18 = 17 is 
an identity element 1 in K. We have seen that any double P-module # 
determines a composite (P;P,, R) where Z denotes the isomorphism 2 — 
and # denotes the isomorphism @;. 

In all of our work we shall identify composites (K, 8,7) and (K’, S’, T’) 
that are equivalent in the sense that there exists an isomorphism k — k’ 
between K and such that (a5)’ = aS’, (a7)’ = a7", The composite (K,S,T) 
will be called a cover of (K’, 8’, 7’) ((K, 8,7) = (K’, 8’, T’)) if there exists 
a homomorphism k’ between K and K’ such that (oS)’ = aS’, (a7)’ = 
Since K = PSP’, it is clear that if such a homomorphism exists, it is unique. 
In fact, it may be characterized by the fact that it maps X«S87 into XaS’B””. 
This, of course, implies that the mapping eSB? > Xa’R7" is single-valued. 
Thus if Sa587—0, then Se5’B7 —0. Conversely if the latter condition 
holds, the mapping 32587 — 32‘’B7" is a homomorphism and hence (K, 8S, T’) 
= (K’,8’,T’). The condition that (K,S8,7) and (K’, 8’, T’) be equivalent 


6 N. JACOBSON. 


is that Sa587 — 0 holds in K if and only if 3eS’B7 —0 holds in K’. Thus 
(K,8S,T) = (K’, ifand only if (K, 8,T) = (K’, 8’, T’) and (K’, 8’, T’) 
then (K,S,T) = (K”,8”’,T”). By the fundamental theorem of homo- 
morphisms of rings any composite covered by (K,S,7') is equivalent to a 
composite (K — B,S,T) where B is an ideal in K and S and T are obtained 
from S and T by applying the natural homomorphism that maps the element 
k of K into its coset k =k +- B. 

Any composite is equivalent to a composite determined by a double P- 
module. For we may take tf to be A and define ar = aSz = xaS and va = a? x 
for any z in and any in P. Then (#:P,) = (K:P7*) is finite. 
It is readily seen that the composite (¥t, 1, #) of R is equivalent to (K, 8,7’). 

If S is a submodule or a difference module of a double P-module #, then 
the correspondence between %a,8; in ft and a8; in © is clearly a homo- 
morphism. Thus the composite (PiP,,l,R;%) associated with # is a cover 
of the composite (P:P,,L,R;S). If R=—R, then if = 0 in both 
and R., == 0 in R. It follows that (PiP,, L,R;R) is a least common 
cover of the composites (P:P,, L, R; 91), i= 1, 2, in the sense that any cover 
of the latter two composites is a cover of (P:P,, L, R; 9). We may also define 
the least common cover (A,S,7) 4+ (L,U,V) for any two composites 
(K,S,T) and (L,U,V) abstractly: For let K @ L be the direct sum of the 
rings K and L. The elements of K @ L are uniquely representable in the 
form, k+1, keK and leL and (k+1)(M +17) +I. We set 
aX aS 4 and a¥ =al+t a’. Then the set of elements «*(a¥) is a 
field P*(P*) isomorphic to P. If k,,-- -,kq is a basis for K over P? and 
l,,- is a basis for over P’, the elements k,,: - -,kg; form 
a basis for K @L over PY. Thus ((K+UL):P¥) is finite. Hence if 
M = P*XP*, we may conclude also that (M:PY) is finite. Thus (M, X,Y) 
is a composite of P with itself. Since + + BY’) 
= + —0 if and only if 0 and 0, (M, X,Y) 
is a least common cover of (K,S,7) and (L.U,V). The least common ¢over 
is uniquely determined in the sense of equivalence. In a similar fashion we 
may define a least common cover for any finite number of composites. 


We have seen that the fixed elements of a self-representation FY are the 


elements y such that y:—vyr in the double P-module associated with JL. 


Evidently these elements are determined by the composite (P)P,, L, PR). 


Accordingly we define an element y of P to be a fixed element of a composite 
T= (K,S,T) if These elements form a subfield of P. 


f 

(| 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL FIELDS. 7 


4, The relations space of a composite. Let [= (K,S,T) be a com-’ 
posite of P with itself and suppose that (K:P”) =q. Since K = PSP’, there 
exists a basis for K over P? consisting of elements a,5,- - -,@ in PS. For 
any « in P we may write 


Since this expression is unique, the mapping Mj: «—>pi(«) is single-valued. 
Since («#-+ B)S=aS+ BS, pi(a+ B)™ = pila)? + wi(B)? and hence 


(3) (a+ B)M; = aM; + BMi. 


If y is a fixed element under T, (ay)5S = aSyS — oSy7, Hence 
(4) (ay) Mi = (aMi)y. 
Now suppose that 
Then 
On the other hand, aS8S == (a%8)S = ajSui(aB)? so that 


(6) («B)M; = 


‘These equations may be written in a more useful form as follows: If 8 is any 
element of P, we let 8 denote the multiplication «a8 =a. With this 
notation equation (4) becomes 


(4’) = Mi¥ 
and (6) becomes 
BM; = 


where {\i = Speinu(8 My). Equation (3) states that M; is an endomorphism 
of the additive group of P. Of course, the multiplications B are also endo- 
morphisms. Hence this is true for any transformation of the form 3Mjpi, 
pi in P. 

We suppose now that >, is a second set of elements such that 


8 N. JACOBSON. 


B,S,- - -, is a basis for K over P?. Then BiS = where (pij) = 


is a non-singular matrix. If R-* = (oi;) and 


aS == B,Sy,(a)7+- + BaSvq(a)? 
then pi(a%)? = Spij7vj(z)? and = Thus 
(7) M; = SN and N; = 


Hence the totality % of endomorphisms (of the additive group of P) of the 
form =Mip; is independent of the choice of the basis. It is clear also that 
any two equivalent composites determine. the same sets Mf. The set % is 
evidently closed under multiplication on the right by the multiplications p. 
By (6’) %f is also closed under multiplication by p on the left. We assert now 
that the M; are right linearly independent relative to the p. For by (2), 
Hence ai:3Mjpj pi and so if SMjpj —0, pp and pj — 0. 
We shall call & the relations space of the composite (K,S,7) and we shall 
denote the set of multiplications p by P. We have therefore proved the 
following 

THEOREM 1. Jf & is the relations space of the composite (K,S,T) and 
P is the set of multiplications, then UPS°M and PMSMW. The right di- 


menstonality (K:P7).° 


5. Conditions for covering composites. If It is a two-sided P-module 
we have defined the composite of # as (P:P,,LZ,R) where ZL is the corre- 


spondence «—> a, and is the correspondence «—a@,. Now let 2m 
be a basis for # over P, and set ax; — Sa;(ahi;). As we have seen the 
correspondence = (ahj;) is a self-representation. By the well-known 


isomorphism between linear transformations and matrices, we may substitute 
for a, the matrix a” and for a, the matrix a? = {a,- --,a} and obtain in 
this way a new composite equivalent to (P:P,,l,R). We denote this com- 
posite as (P¥P”, FE, D) where F is the mapping «> «” and D is the mapping 
a—>a?, 

Consider now the transformations /;;. By (1) Ei; is an endomorphism 
of the additive group of P and the set of ;; satisfy the following conditions: 


®*Using (5) and (6’) one can show that the self-representation determined by the 
basis M, of the double P-module 9{ is H’ the transposed representation of the self- 
representation H determined by the basis a,5,. . -,a)8 of K. See a forthcoming paper 
of the author’s entitled “ Construction of central simple associative algebras.” 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL FIELDS, 9 
(8) = 


if ye ® the field of fixed elements and 


(9) = 

As in the preceding section we determine the transformations M; by choosing 
Then = +: a@ug(a)” and 


Thus 
(10) Bi; = + ++ + Mg 


is in the relations space M of our composite. It follows that the set of endo- 
morphisms of the form 3/j;pi;, where the p are arbitrary multiplications, is a 
subspace 8 of 9 regarded relative to P on the right. We assert that 8 = Y. 
For otherwise there exist p< q endomorphisms N,,- - +, Np, in & such that 
or = (aN,)oij, +: + Thus each matrix is a linear 
combination of the p matrices ox = (oijr), k =1,-- p, with coefficients 
aN; and this contradicts the fact that a,%,-.- -, a” are linearly independent 
over P?. This proves 


THEOREM 2. Jf E;; are the E’s determined by a self-representation E, 


then the set of endomorphisms SEijpi; is the complete relations space X of 


7 


the composite associated with FB. 


Suppose now that =(K, S, T) is a cover of the composite I’ =(K’, 8’, T’). 
We determine the M; for K by equation (2). Then 


and every element of K’ is a linear combination of the a;‘’ with coefficients 
in P7’, Now let Bp, be elements of P such that the form 
a basis over P”” of K’. Then we have aS’ = SBx5’pxi?”. Hence 


where = The mappings form a basis 


R 

ie 

it 

is 

Vy 

), 

1 

e 

| 


10 N. JACOBSON. 


for the relations space Y(I”) of I” and we have shown that Ni = 3M ipxi. 
Thus A(T). 

Now suppose conversely that %(1”) = M(T). Let R and S be double 
P-modules that have composites equivalent to (K,S8,7') and (Kk’, 8S’, T’) 
respectively. Let be a self-representation determined by and one 
determined by Since A(T) it follows from Theorem 2 that 
‘there exist elements pij.x: such that Fy: = SHijpjixr for all k and 1. These 


equations may be written in the form 


where px: is the matrix (pij,x.). Now suppose that we have a relation of the 
form 0 where B? = {8,- - -, 8}. Then it follows readily from (11) 
that Thus (K,8,7) is a cover for (K’, 8’,T’). This completes 


the proof of the following important 


THEOREM 3. A necessary and sufficient condition that the composite 


r=T is that the relations space X(T) = A(T”). 
This implies 


THEOREM 4. A necessary and sufficient condition that T=I” is that 
W(T) A(T’). 


6. Non-singular composites. If (A,S,7) is a composite such that 
(K:PS) < o, it defines a new composite (K,7,S). It may happen that 
both (K: PS) and are finite but that these dimensionalities are not 
equal. For example let A = P= (?) the field of rational functions in one 
indeterminate over ®. Let 7 be the identity and S the automorphism defined 
by 4’ =. Then the self-representation determined by K is the isomorphism 
a(t) —>a(d*) that maps P into a proper subfield. We may avoid this type 
of situation by restricting our attention to composites (K, 8,7’) that are non- 
singular in the sense that (K: PS) = (K:P7). If (K,S,7) is non-singular, 
the composite (K, 7, S) will be called the inverse of (K,S,T). If (K.S,T) 
is equivalent to (K,7,8) we shall call this composite symmetric. 

Corresponding to the concept of non-singular composite, we define a 
double P-module # to be non-singular if (3: P1) = (KR: P,-). Suppose that 
Jt has this property. We prove first the following 


LemMA. There exists a set of elements y:,° Ym that constitute both 


a left and a right basis for R over P. 


i 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL FIELDS. 11 


The element y; may be taken to be any element #0 in #. Then yia~0 
and ay,=40 for all Now suppose that y;,- - -,yr are elements of 
which are left linearly independent and right linearly independent over P. 
Then if r< m= (}t: P:) = (KR: P,), there is an element y not of the form 
Saiyi. If y does not have the form yja’; either, y may be taken to be the 
element Ymii- Hence suppose that y = Syi#’;. Similarly we choose an element 
z not of the form Yyi8i and we may suppose that z= P’iyi. Now form 
w=y+z2. Then if w= z—w—y contrary to 
assumption. Similarly, w cannot be represented in the form =Biyi. Hence 
we may take ym.—w. This process leads to a basis of the required type. 

The same method may be used to show that if © is a non-singular sub- 
module of St, there exists a two-sided basis for # that includes a two-sided 
basis for ©. 

Now if ® is a non-singular double P-module, we shall define the inverse 
R-* of R to be the module whose elements may be put in (1 —1) correspondence 


with the elements of 9t in such a way that if rc 2’ in Rt", 


(ety) 
(ax)? == 


(12) = az’. 


Thus 8 is obtained from K by interchanging the roles of Px and P,. If 
is a left (right) basis for R, then - -,2’m is a right (left) 
basis for Let y:,° +, Ym be a two-sided basis for R and ay; = Sy; (aH;i), 
yit = X(aH*;;)y;. Then we recall that the correspondence — a” = (aij) 
is a self-representation determined by In we have ay’; = Sy’; 
and Thus = (ah*;;) is the self-representation 


associated with 3t-!. We note that 


4 


= (AH ji) = yx 


Hence ix Sah and == Sak or 


These equations may be written in a simpler form by introducing the ring 
© of m-rowed matrices with elements in. the ring © of endomorphisms of the 
additive group of P. For we set (/) = (4i;) and call this matrix in €,, the 
matrix of the self-representation. If (£*) = (£*;;) then (13) may be 


replaced by the single equation 


e 

) 

t 

i 

4 


12 N. JACOBSON. 


(14) = 1 = 


where the prime denotes the transposed matrix. The representation « > (aL*;;) 
will be called the inverse E-' of the representation EF. 

If (K, 8,7) is a non-singular composite, we have seen that we may take 
K to be a double P-module by setting ar = aSx = and ra = = 
The dimensionality = (K: PS) = (K:P7) =(R:P,). Hence is 
non-singular and its inverse is = (K,7,S) with P; here = P? and P, = 
If we choose two-sided bases in these modules we obtain a pair of inverse 
self-representations of P. 

If (K,S,T) and (L,U,V) are non-singular composites, their least 
common cover (M,X,¥Y) is also non-singular. For, let # be a double P- 
module such that # = R, @ R. where the Rj are non-singular submodules 
and $f; has the composite (K,S,7) and ®. has the composite (L,U,V). 
Then = (91: Pr) + (Re: Pr) = (Hi: P-) + (Me: (RM: P,) and 
so t is non-singular. Thus (M, X,Y) is non-singular. Its inverse (M, Y,X) 
is evidently the least common cover of (K,7,S) and (L,V,U). It follows 
from this that if (K,S8,7) is an arbitrary non-singular composite, then 


(K,S,T) (K,T,S8) is a symmetric composite. 


7. The product of self-representations. If L and F are two self- 
representations of rank m and r respectively, they determine a self-representation 
of rank mr obtained by substituting for the elements «Hi; of #” the matrices 
(«H;:;)” that represent these elements under F. We shall call this repre- 
sentation the product E X F of the representations E and F. The product 
Ii X F is an associative one. Moreover if 1 denotes the identity self- 
representation «—>(a) of one row, then 1 X1—=H—1X FE for all L. 
Hence the set of self-representations forms a semi-group with an identity 
relative to our composition. If (/) is the matrix of F and (/’) is the matrix 
of F, then the matrix (?) of P= X F is the (right) direct product 
(FE) X (PF), ie. = xt. 

We shall define next a product of double P-modules that corresponds to 
the product of self-representations. Let # and © be two double P-modules. 
We wish to construct a double P-module $% and a function zy of 7 in ® and 


of y in S such that the following conditions obtain: 


1, a(ay) = 
2. (ry) % = «(ya) 
3. (ra)y = x(ay) 
4, a(y+y) 


| 
| 


ike 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL FIELDS. 13 


5. 
6. Any element of $$ has the form ry for suitable x in Ht and y in. S. 
($%:P-) — P,-) (SG: 


Let 2%,° be a right basis for and ax; = and let y1,° yr 
be a right basis for S and ay, = Syi(alx). Then if c= Xai€; and y = Syxm 
conditions 1.-5. imply that 


(15) ry = m. 


By 6. every element of $8 is a linear combination of the elements riy;z. Hence 
by 7. these elements form a right basis for % over P. The procedure for 
defining $8 and wy is now clear. We let 8 be a right module of dimensionality 
mr over P and define xy by (15) where aiyx = %i-1)rx form a basis for §. 
Then 2., 3., 4., 5. and 6. are valid. Next we define «, to be the linear trans- 
formation in $$ over P, that sends riy, into Sajyi(@Hjilk'x%). We set az = a2 
and we observe that a(ry) = Sa,yo(alyjFor) (€;Fm)m. We may now verify 
that 1. holds and hence ‘8 is a double P-module and zy is a function satisfying 
our conditions. 

If ,2’m and yr, respectively, form a second pair of 
right bases for and for S, « = 3a if; and y = Hence 
ry = Sa’ 1.) Thus all the zy are linear combinations of the ele- 
ments «jy, and so these elements form a second right basis for $. Now 
suppose that we construct a second space $8’ and a second function zy which 
we shall denote as [ay] by using the bases 2’; and y’x in place of the a; and 
the Then it is readily verified that the mapping 32’ iy’xéix > léix 
is an isomorphism between the double P-modules $8 and $’. In this sense the 
module $§ does not depend on the choice of the bases 2; and yx. We shall 
therefore denote this module as # X © and shall call it the product of the 
modules and GS. 

By definition 

Hence the representation determined by the basis zs, 2i-1yre = iyx is the 
product HE X F. The independence of $= R X S of the bases of RK and S 
may be stated as 

THeoremM 5. Jf EK’ and F’ are self-representations similar respectively 


to FE and F, then EF’ & F” is similar to E X F. 


If 2p are right linearly independent in and are 
right linearly independent in © then the elements 2’jy’: are right linearly 


is 
ps 
ist | 
P- | 
les 
nd | 
ws | 
en | 
n 
es | 
t 
e- 
ct 
f- 
y 
x f 
t 
s. 
d 


14 N. JACOBSON. 


independent in $8. For we may supplement the 2’ and the y’ to obtain bases 


4 _/ . , y 

Then we have seen that the elements 2’iy’;, are 
linearly independent. Now suppose that the elements 2’,,:--,2’p form a 
right basi; for a submodule of and the elements - -,%q form a 


right basis for a submodule SG’ of S. Then the elements 2’jy’; form a basis for 
a submodule $8’ of $$ and is essentially the module This proves 

THEOREM 6. If EH” is a subrepresentation of EF and F” is a subrepresenta- 
tion of F, then E’ X F’ is a subrepresentation of E X F. 


In a similar manner we may prove 


THEOREM 7. If EF is decomposable into the components E; and F is 
decomposable into the components Fj, then KE X F is decomposable into the 


components Hi XK Fj. 


We suppose now that # and S are non-singular double P-modules and 
that the z; and yx are two-sided bases. Let the elements 2’; and yx form the 
corresponding two-sided bases for and If = Saj(akji), 
= 32’;(«H*;;) where the self-representation H* thus determined is the 
inverse of H. Similarly ay, = Xyi(¢F'n) and ay’, = where F* 
is the inverse of F. Now = and = and so 
(ign) = X(aF*.H*;;)xjyr. Hence every element of # X © is a left linear 
combination of the elements xiy,;. These form a basis. For if Spariy, = 0, 
ix = 0 and hence = 0 for all j and l. It follows that 
Birk = = 0. We have therefore proved that R © is 
non-singular and that the elements 2;y;, form a two-sided basis for this module. 
The elements (2iyx)’ form a two-sided basis for the inverse module and we 
have «(aiyx)’ = X(ajy1)’(aF*4H#*;;). Our argument shows also that the 
elements y’,«’; form a two-sided basis for G-* & 9 and we may verify that 
= Sy’ x’; Hence the modules (R and 


are isomorphic. 


8. The product of composites. It is easy to see from Theorems 2 and 
4 that the composite determined by  & S depends only on the composite of 
% and the composite of S. For the composite of R K © has as its relations 
space the smallest subspace over P (on the right) containing all of the products 
MN where M is in the relations space of the composite of # and N is in the 
relations space of the composite of S. Thus the composite of # XK © may be 
regarded as the product of the composite of # by the composite of ©. It is of 
interest to give a direct definition of the product of composites. We shall 


i 
i 
t 
4 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL FIELDS, 15 


obtain such a definition by introducing a type of three-fold composite of P 
with itself. Aside from its use in the present connection, the concept of a 
three-fold composite will play an important role in the proof in 9 of one of 
the main results of the present theory. 

We define first a three-fold composite of P with itself as a system 
(A, A, B,C) consisting of a ring A and the isomorphisms A, B, C of P into 


subfields P4, P?, P© of A such that 


1. A= PApspe 
2, Ais commutative 
144 = 18 = 1°, 


Evidently by 1. and 3., 14 = 14 = 1¢ is the identity 1 of A. 

Covers of three-fold composites are defined in the obvious way: (A, A, B,C) 
is a cover of (A’, A’, B’,C’) if there is a homomorphism a—a’ between A 
and A’ such that == a4’, (a3)’ == a8’, (a@)’ Equivalence is defined 
in a similar manner. 

Now suppose that (A,S,T) and (L,U,V) are arbitrary (two-fold) 
composites. We wish to construct a three-fold composite (A, A, B,C) having 
the following properties: 

4, (P4P8,A,B) is equivalent to (K,8,7) and (P’P°, B,C) is equivalent 
to (L,U,V). 
5. (A: PC) = (K:PT)(L:PY). 


We shall show first that any two three-fold composites having these properties 


are equivalent. For this purpose let @,,- - -,%q be a set of elements of P such 
that a,5,- - -.a form a basis for K over P? and let B,,- By be elements 
such that B,",- - -. By” form a basis for L over PY. As before, we determine 


the endomorphisms pi(a) and «—v;(«) =aN; by writing 


aS == a (a)? + aSug(a)? 
Suppose also that 
In A we have a4 = and Sates’, 
Bj? = Hence 


Let A denote the totality of elements of the form 3a;48;8f;;°. By (16), A is 
a subring of A. Since 


ses 
ire 
a 
a 
for 
res 
a- 
is 
he | 
id 
he | 
he | 
so 
ar | 
is | 

d 
f 
1S 
s | 
e | 
e 


16 N. JACOBSON. 


PA=A. Hence 1 and P°C=A. Moreover, Bj? = (a;4)-1(a;48;2) «A and so 
each «8 — 38;y;(a)° is in A. Thus A=A and every element of A is a 
linear combination with coefficients in P© of the gq’ elements «;48;. Hence 
by 5. the elements %;48;% form a basis for A over P°. 

Now suppose that (A’, A’, B’,C’) is a second three-fold composite having 
the properties 4. and 5. Then the mapping 3a4Bj8fij° By? is 
clearly (1—1) and by (16), it is an isomorphism. It is evident from the 
above considerations that this isomorphism maps @4 into 24’, #8 into 2%’ and 
af into 

We now construct the composite (A, A, B,C). Suppose that the elements 
ai and £;" have been chosen in the way indicated above. We shall suppose 
in addition that a,5—15 and B,Y =14. Now let P© be a field isomorphic 
to P under the isomorphism «— «© and Jet A be the algebra over P© with the 
basis «;48; and the multiplication table 


(18) (6 = Bu? quar qv © 


We shall show that A is associative and that A determines a composite satis- 
fying our conditions. We note first that «,48,% acts as the identity 1 of A. 
The elements = satisfy the multiplication table = 
and these elements are linearly independent over P©. It follows that the set 
of elements a? = 38;%y;(%)© is a subfield P? of A isomorphic under the 
correspondence 2?» to P. The totality of elements 38;2p;° where the pj; 
are arbitrary defines a composite (P?P°, B,C) equivalent to (L,U,V). We 
note next that if we set a;4 = 2,48,% the product of this element with 8, 
(in either order) is the element «;48;? of A. The elements a;4 satisfy the 
relations 


and these elements are linearly independent over P¥. If we make use of 
equations for the ~ analogous to (6), we may prove that (a;4p®) (a:40%) 
= 3a)4eii p20". It follows that the totality of elements of the form 
%ai4pi® is a ring isomorphic to K. The set of elements a4 = Saj4pi(a)? is a 
field P+ isomorphic under the correspondence a4—>@ to P and the set of 
elements 3a;4p;” determines a composite (P4P, A, B) equivalent to (K,S,T). 
Now the elements of A may be written in one and only one way in the form 
where /;¢P2P°. We may prove that (a;41) = (ei 2lm) 
and this implies that A is associative. Since A is generated by P4, P? and 


\ 
| 
| 
b 
i 
i 
i 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL FIELDS. 17 


p© =1P°, A is commutative. Conditions 4. and 5. are evidently satisfied. 
Hence (A, A, B,C) is the required three-fold composite. 

We now define the product of the composites (K,S,T) X (L,U,V) as 
the composite (P4P°, A, C) determined from the three-fold composite 
(A, A, B,C) that we have constructed. This product is uniquely determined. 
{t can be seen that if (K, 8,7) is a cover of (K’,S’,T’) and (L,U,V) isa 
cover of (L’,U’, V’), then the three-fold composite (A, A,B,C) is a cover 
of the three-fold composite (A’, A’, B’,C’) constructed from (K’, S8’,T’) and 
(L’,U’, V’). It follows that (K, 8,7) X (L,U,V) is a cover of (K’, 8’, T’) 
x (L’,U’, VV’). It (K,S8,T) and (L,U,V) are non-singular, the three-fold 
composite (A, C,B,A) has the same relation to (L,V,U) and (K,T,8) as 
(A, A, B,C) has to (K,8,T) and (L,U,V). Hence (L,V,U) X (K,T, 8) 
= (P4P°,(,4A) and the latter is the inverse of (P4P°,A,C). It is not 
difficult to give a direct proof of the associative law fer multiplication of com- 
posites. For this purpose it is necessary to construct of four-fold composite 
(H, A,B,C, D) such that (P4AP?P°, A, B,C) is equivalent to (A, A, B,C) and 
B,C, is equivalent to the composite (A,, B,,C,) determined 
by (L,U,V) and (M,X,Y). We shall not carry through this proof but 
instead we shall obtain the associative law indirectly by determining the 
relations space of the product of the composites (K,S,7) and (L,U,V). 


Let 8,4,- - -,8-4 be a basis for PAP© over P© and suppose that 


Then the endomorphisms «— m(a)==aP; form a basis over P of the rela- 
tions space of the composite (P4P°,A,C). Since = a4 = 
Sa (8x) ( ye and since pi (dy)? = (wi (8x) ye, 
aA (mi (8x) ) (a) 
Hence by (1% ) vj (mi(@)) = (wi (8x) ) x(a), or 
MiN;j = SPxvj (mi (&) ) 

is in the relations space of (P4P°, A,C). 

Now let R,,- --,2Rs be a basis for the space over P (on the right) 
generated by the product M,N; and write aR; = p;(a). Then by expressing 
the elements J/;V; in terms of the R, we may replace the relations 


8 
(20) == > (a) © 
1 


where the ¢,¢ 4. We wish to show that the &«P4P°. For this purpose we 


require the 


2 


0 
a 
is { 
} 
ic 
1e 
‘ 
ot | 
e 
» 
f | 
n 
a 
f | A By, ( C by 
| 


18 N. JACOBSON. 


Lemma. If R,,--+,Rs are endomorphisms in P that are linearly w- 
dependent over P, then there exist elements dy, k =1,~++,s8 such that the 


matrix (AxR1) is non-singular. 


Suppose that we have already determined r elements Ax such that the r 
vectors (Axf,,: - +,Axfs) are linearly independent. Then we assert that if 
r<_s, there exists an element A,,, such that the vectors (A;R1) are 


linearly independent. For otherwise for each A we have 
(21) AR, = Rio, (A) +: ‘+ ArRior(A), = 1,- 7 


If = 0 for all 1, by the linear independence of the vectors 
for k =1,:--,r, each og =0. It follows that the o in (21) are uniquely 
determined and so the correspondence A ox(A) = is single valued. It 
is clear that S; is an endomorphism of P and the relation (21) states that 
each F, is a linear combination with coefficients in P of the r endomorphisms 
S;. This contradicts the linear independence of the #’s and proves the lemma. 

Now choose the A’s as in the lemma and write Ai4 = Skapx(A1)©. Since 
(px(Ar)) is non-singular, there exist such that Spx (Ar) = Sua. Then 
fv = SAr4rn© is in PAPS, We now write & = 38:4Bu.6 and substitute in 
(20). This yields a4 = and so by (19) = SBrxpx(). 
Thus P; = SRB is in the space generated by the products M;Nj;. This 
completes the proof of 


THEOREM 8. The relations space of a product of two composites is the 
smallest space over P containing all products. MN where M is in the relations 
space of the first composite and N is in the relations space of the second 


composite. 


This along with Theorem 4 proves that multiplication of composites is 
associative. Now it is readily seen that the relations space of the least common 
cover (K,S,T) + (L,U,V) is the join of their respective spaces. It follows 
from Theorem 8 that the distributive laws hold: [(K,S,7) + (L,U,V)] 
K (M, X, Y) = (K,8,T) K (M, X,Y) + (L,U,V) X (M, X, Y) and 
(M, X,Y) X [(K, 8, T) + (L, U, V)] = (M, X, Y) X (K, S, T) 
+ (M, X,Y) X (L, U,V). We note also that any two composites 
(P¥,U,U) are identical and (P¥,U,U) satisfies the equation 7’) 
(PY,U,U) (K, 8.T) = U,U) XK (K, 8,T). For this reason 
we shall call (PY, U,U) the identity composite. 

If we refer to the preceding section we see that the composite of a product 
of two representations or modules is the product of the corresponding 


composites. 


4 
i 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL FIELDS. 19 


9. Closed composites. We shall call a composite f= (K,S,T) closed 
(under multiplication) if (K,S,T) = (K,S,T) XK (K,8,T). By Theorem 
3 and the considerations of the preceding section, (K,S,7') is closed if and 
only if its relations space %((T) is a ring. A composite (K,S,7) will be 
alled a Galois composite if 1) (K,S,T) is closed, 2) (K,S,T) is a cover 
of the identity composite (P¥,U,U) and 3) (K,S,T) = (K,T,8). It will 
be a consequence of the results of 9 and 10 that any closed composite is a 
Galois composite. Hence the conditions 2) and 3) can be deleted in the 
definition of a Galois composite. We have preferred, however, to state them 
as separate conditions since they correspond to the existence of the identity 
and of inverses in a group. 

We suppose now that T = (K, 8,7’) is a closed composite and we let ®r 
denote the set of fixed elements relative to T. Let @,,- + +, be elements 
of P such that 2,5,- - -, a5 form a basis for K over P?. We assert that these 
elements form a basis for P over ®r. To prove this we consider the three-fold 
composite (A, A, B,C) that satisfies conditions 4. and 5. for (K,S,7T) and 
(L,-U, V) = (K,8,T). In (K, 8,T) we have a5 = a,5y,(«)? +--- 
+ aqSuq(a)7. Hence in A we have the relation a4 = Saj4u;(a)%. Since 
(K, S, T) is closed, (P4P°, A, C) is covered by (K, 8S, T). Hence 
== Thus —pi(a)©) =0. We have seen that 
the elements «;4 are independent over P?P°. Hence this relation implies that 
pi(a%)?® = pi(a)© and so each pi(a) «Sr. Thus each « in P is a linear com- 
bination with coefficients in ®r of the elements -,%q. Since the elements 
aS are linearly independent over P’, the elements a; are linearly independent 
over 


THEOREM 9, Let T= be a closed composite, (kK: P7) =q and 
let yr denote the field of fixed elements relatwe to T. Then (P:®r) =q 
and if %,° * +, %q are q elements of P, they form a basis for P over ®r if and 
only if the elements a,5,- - form a basis for K over 


The necessity of the condition is trivial. For if #,- - -,%q from a basis 
for P over ®r, each = @,Sy,7 - - + agSyq? where the yi7 e Sp? = 
Hence the are linearly independent over 

We consider now the relations ring Y%(T). We have seen that its dimen- 
sionality over P is g. Hence the dimensionality of %(T) over ®p is g?. On 
the other hand we have seen that the transformations belonging to %(T) 
commute with the elements of ®r, i.e., they are linear transformations in the 
space P over 6p. Since (P:@r) —g, it follows that &(T) is the complete 
set of linear transformations of P over ®r. 


i 


20 N. JACOBSON. 


THEOREM 10. /f [T= (K,S,T') is a closed composite and ®r is its field 
of fixed elements, the relations space (1) is the complete ring of linear 


transformations of P over ®r. 


Suppose now that IY = (K’,8’,7’) is any composite of P which leaves 
the elements of @r fixed. Then if NV is any element of the relations space 
%(I’), N is a linear transformation in P over @r. Hence by Theorem 10, 
NeM(T), and so [= A(T). This implies the following 


THEOREM 11. Let I be a closed composite and ®y its field of fixed ele- 
ments. Then if TY” is any composite such that @r = op,” =T. 


This, of course, implies that there is only one (in the sense of equiva- 
lence) closed composite having 6 = @rF as its field of fixed elements. 

It may be remarked that if A= (L,U,V) is any composite such that 
(P:,) =r< o, then (L,U,V) is non-singular. For if we set 4 = ®, 
then (PU: Hence if (L:P’) =q’, (L: 6”) and 
since — —rq’. It follows that PY) —q’. A similar 
argument may be used to show that if © is any double P-module with com- 
posite equivalent to A, then © is non-singular. This may be stated as follows: 


THEOREM 12. Jf F is a self-representation of P such that the dimen- 
sionality of P over the field of fixed elements is finite, then F is similar to a 


non-singular self-representation. 


We again suppose that A= (1,U,V) is a composite such that (P: ®a ) 
=r<o. If A is not closed, we form the least common cover A‘) =A 
+AX A. Then the relations space Y%(A') > W(A). It is readily seen that 
the field of fixed elements 6a” so that (P: ®4m) —r also. If 
is not closed we set A®) =A-+ (AX A)+ (AXAXA) and note that 
> and Gam Since W(A), W(A®?) all 
have dimensionalities over ®4 = r*, the dimensionality over ®, of the com- 
plete ring of linear transformations of P over ®,, this process leads after a 
finite number of steps to a closed composite T=A-+- (AX A)+:°°°+ 
XA). By Theorem 10 we obtain 


THEOREM 13. Suppose that «—> (aFx1) is a self-representation of P 
such that (P:®) < o for ® the field of fixed elements. Then if L is a linear 
transformation of P over ®, L is expressible as a polynomial in the transforma- 
tions Fy: with coefficients in P the field of multiplications in P. 


& 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL FIELDS. 21 


10. The fundamental Galois correspondence. We suppose now that 
P is an arbitrary field and that ® is a subfield of P such that (P:®) =q < o. 
We take two copies PS and P” of P and form the direct product K of these 
fields relative to ©.’ Thus K is the set of sums 3aS87, aS in PS and B7 in P? 
where yS=y7 if ye® and addition and multiplication are defined in the 
obvious way. The dimensionality (K:®%)=—q? and 
We recall also that PS \ P? => OS — G7, It is evident that T= (K,S,T), 
where § is the mapping «> 5 and T is the mapping a—> a", is a composite 
of P with itself. Since the mapping a7, a? is an automorphism 
in K, (K,S,7T) is symmetric. Since PS A P? = oS —67, the field Or 
of fixed elements is ®. Now let (1,U,V) be any composite of P leaving the 
elements of ® fixed. Suppose that 3a587—0 in K. Then if @,-- +, 
form a basis for P over &, the elements «;Sa;7 form a basis for K over ®S = 67 
and the elements 2;/a;" are generators of L over DY’ =", We replace «S 
by SyiSaiS, yi in ® and B” by 38;7a;7, 8; in ® and substitute in the relation 
xaSB?—=0. Then the coefficients of the products ajSa;7 in the resulting 
expression are all 0. Hence if we substitute a7 = Syi4ai¥ and BY = 38)"a;" 
in #/¥BY, we see that Sa¥BY —0. We have therefore proved that (K,S,T) 
is a cover for (L,U,V). This shows, in particular, that (K,S,T) 
ond (2,4,7) 2 0,0). 


THEOREM 14. /f P is a field and ® a subfield such that (P:®) =q < ~, 
then the direct product of P over ® with itself determines a Galois composite 
whose field of fixed elements is ®. 


_ Now suppose that T is any closed composite and let @r be the subfield 
of P of fixed elements under T. We have seen that (P-:®r) < o and hence 
we may form the direct product IY of P over @r by itself. By Theorem 11, 
I’ =T and by what we have just proved [=T’. Hence I is equivalent to 


I’ and so we have proved 
THEOREM 15. Any closed composite is a Galois composite. 


As a consequence of Theorems 9, 11, and 14 we have the following 


fundamental 


THEOREM 16. Let P be a fixed field and T a Galois composite of P. If ®r 
denotes the field of fixed elements under T, then P is finite over ®r and the 
correspondence Tr is (1—1) between the Galois composites and the 
subfields ® over which P is finite, tf and only if Sr, S 


7 For the definition and properties of the direct product see, for example, R, p. 88. 


| 


22 N. JACOBSON. 


11. Decomposition of composites. Let [= (K,8S,7) be an arbitrary 
composite. Since K is a commutative algebra with a finite basis over P’, 


we may decompose K as a direct sum 
(21) 


of indecomposable algebras K;. The Kj; are uniquely determined. Moreover 
if B is any ideal in K, B= B, @Bs where B; = BA K;. Hence if 
N is the radical of K, N=N, @--:-@Ne where Ni= NA K; is the 
radical of K;. We recall now that any indecomposable commutative algebra 
is completely primary in the sense that the difference algebra with respect to 
its radical is a field.6 Any ideal of such an algebra is contained in the radical. 
It follows that the homomorphic image of such an algebra is also completely 
primary and hence is also indecomposable. These results apply in particular 
to the algebras Kj. 
Corresponding to the decomposition (21) we write 


where 
C4" = = 0 if 


/ 


Then K; = Ke;. The mapping k— ke; is a homomorphism /; of K into Kj. 
We set = =TH;. These are isomorphisms between P and sub- 
fields PS: and of K;. Since K = PSP’, = Evidently (K: 
= P") and since kia? =kial if Ki, (Ki: = (Ky: P"). 
Hence (K;:P?‘) is finite. Thus (Ki,S8i,7;i) is a composite of P. It is 
evident that (K,S,7') is the least common cover of these composites. We 
shall now show that the composites (Ki, Si, 7’) are disjoint in the sense that 
there exists no composite (L, U,V) which is covered by both (Ki, Si, T7;) and 
the least common cover of all the (4j,S8;,7;) for 7s1. For let (L,U,V) 
be such a composite. Suppose that ej = SaS8?. Then ej = e;? = SaSiBl* and 
= eje; = for all At. In L we have = and 0 = 
and this contradiction proves our assertion. 

If B is an ideal in K and k-—>k denotes the natural homomorphism 
between K and K = kK — B, it is clear that (K, 8,7) where aS = aS, pT = Bt 
is a composite of P covered by (XK, 8,7’) and, as we have remarked, any com- 
posite covered by (A,S,7') can be obtained in this way. If C is an ideal 
containing B, it is readily seen that the composite (K — B,S,T) is a cover 
ot (kK —C,8,T). Now since any ideal B; of K; is contained in the radical 


®Cf. Theorem 3, Chapter 4 of R. 


| 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL FIELDS. 23 


Ni, this shows that any composite covered by (Ki, Si,Ti) is a cover of the 


composite (K;— Ni, S;,7;). Thus it is impossible to find disjoint com- 
posites which are covered by (Ki, Si, Ti). 

From the decomposition K = K, @ Kz we obtain K—B = kK 
Also B=B, Bs where Bi BA Kj. Hence 
fk 0. where 
the Bi. Then kj =); is in Bj and ki =0. Thus K = Kk, @k.. 
Hach K; is indecomposable since it is the homomorphic image of the indecom- 
posable algebra Kj. We suppose that K; 0 if ir and that Kj =0 if 
i>r. We now denote the mapping kk by H, the mapping k > k, by 
and we set Si = T; Then (Kj, is a composite of 
P with itself. Moreover, these composites are obtained from (K,S, 7) in the 
same way that the (Aj, Si, 7;) are obtained from (K,S,7). We assert now 
that (Ki, Si, is covered by (Ki, Si,T;). For suppose that 0. 
Then if = = 0. Now in the decomposition k =k, +- - --+ ks, 
k; is the coset of ki. Hence =0. Thus HE: — = 0 
and this proves our assertion. In particular we see that if (K,S,T) is 
indecomposable, i.e. if * 1, then (K,S8,7) is covered by one of the com- 
posites (Kj, Si, Ti). 

Now suppose that (A, 8,7) is the least common cover of the composites 
=1,- which are disjoint and indecomposable in the 
sense that they are not least common covers of disjoint composites. Then each 
algebra K’; is indecomposable and since (K,S,7) is a cover of (K’;, 8’;, T’;) 
one of the (Ki, Si,7;) is a cover of (K’;,8’;,T’;). Since the (Ki, Si, Ti) 
are disjoint, only one of these, say (Kj, Sj, 77), is a cover for (K’;, 8’;, T’;). 
Since no two composites covered by (4j, 8;,7j) are disjoint, it follows that 
if j Ak, then 7k. If we re-arrange the (Kj, S;,7j;), we may suppose that 
j=j. We wish to show that (Kj, S;,7';) and (K’;,S’;,7’;) are equivalent 
and so we assume that this is not the case. Then there is a relation SAS?" 
= 0 such that If = SaSB7, SaSiBT and 
Since =0 if jk, SaSeBSe—0 and hence — 0. 
Thus we may suppose at the start that SAS*u7* = 0 for k ~j. It follows that 
SAS ey? — 0 and hence that SASu? = 0 and this contradicts the fact that 
SASip?s AO. Now it is clear that s’=s. For otherwise there exists a 
(Ki, Si, Ti) which is covered by the least common cover of the (K;, 8;, 7';) 


for 7 Ai. We have therefore proved the following: 


THEOREM 17. Any composite (K,S,7T) is expressible in one and onli 
y 


one way as the least common cover of disjoint indecomposable composites. 


| 

s 

e 
t 
i @ 
) 
d 
v 

n § 
_ 
- 
§ 
| 


24 N. JACOBSON. 


We shall call the indecomposable composites (Ki, the imdecom- 
posable components of (K,S,7'). The argument given above proves 


THEOREM 18. If (L,U,V) ts covered by (K,S,T) then each indecom- 
posable component of (L,U,V) is covered by an indecomposable component 
of (K,8S,T). 


Suppose now that (K,8,7') is a Galois composite. Then we have seen 
that any composite covered by (K,S,7) is non-singular. This applies in 
particular to the indecomposable components (Ki, Si,T;). Consider now the 
inverses (K;,7;,8;). It is clear that these are indecomposable and disjoint 
and that their least common cover is (K,7,S). Since (K,S,7) = (K.T,S8) 
it follows that the set of inverses (K;, 7;, S;) coincides with the set (Ki, Si, Ti). 
We observe next that since (K.S,7) & (K,S,T) S (K,8,T) and since 
(K,S,T) X (K,8,T) is a cover of every product (Ki, Si, Ti) (Kj, 8;,T;), 
the indecomposable components of these products are covered by suitable ones 
of the (Kx, S:,7:). We note finally that since (K,S,7) is a cover of 
(PY,U,U) one of the (Kj, Si, Ti) is a cover of this composite. Conversely 
suppose that (K,S,7) is any composite whose indecomposable components 
(Ki, Si,Ti) satisfy 
1. Each (Kj, is non-singular, 

2. One of the (Ki, Si, Ti), say (Ki, 8;, 71), is a cover of the identity com- 
posite (PY,U,U), 

3. For each (Kj. Si,7;) the inverse (K;,7;i,8;) is an indecomposable 
component, 

4, The indecomposable components of (Ki, Si, Ti) &K (Kj, Sj, Tj) are 
covered by the (Kx, Sx, Tx). 

Then since (K,S,T) & (4K,8,T) is the least common cover of the products 
(Ki, Si,Ti) X (Kj, 8;,T;), (K.S,T) is a cover of (K,S,T) X (K,8,T). 
It is evident also that (K,S.T) = (P¥,U,U) and that (K,S,T) is 
symmetric. 

THEOREM 19. Conditions 1.-4. on the indecomposable components of a 
composite are necessary and sufficient that the composite be a Galois composite. 


12. Separable fields. We shall call a composite (K,S,7') simple if K 


is a field. In this case the only composite covered by (K,S,T) is (K, 8,7) 


itself. (K,S,T) is semi-simple if its indecomposable components are simple. 


If the components of (K, 8.7) are (Ki, Si, Ti), i=1,° 


-, 8, it follows that 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL FIELDS. 25 


any composite covered by (K,S,7) is the least common cover of certain of 
the (Ki, Si, Ti). 
We shall prove first the following 


THEOREM 20. Let P be a field, ® a subfield such that (P:®) =q< @ 
and let T= (K,S,T) be the Galois composite whose field of fixed elements 
dr =, Then a necessary and sufficient condition that P be separable over ® 
is that T be semi-simple. 


Sufficiency. Let a be any element of P and set } = @(a?) where p is the 
characteristic of @. Consider the Galois composite A= (L, U, V) of P having 
X= 4 as its field of fixed elements. Since A is covered by I, it is semi- 
simple. Now (a¥)? = Hence —0 and 
so Thus ae} (a?) and hence @ is separable. 

Necessily. Suppose that P is separable over ®. Then there. exists a 
primitive element 2 in P such that P=@®(a@). Let (L,U,V) be an indecom- 
posable composite and let #”(A) be the minimum polynomial of #4 over the 
field P’. Since is an indecomposable algebra (A) = w"(A)¢ where (A) 
is irreducible. For if (A) (A) where (Wi! (A), (A)) = 1, 
Wi! (a) is a zero-divisor 0 in L and since any zero-divisor of a completely 
primary algebra is contained in the radical ¥,”(#”)f—=0. This implies that 
ui" is divisible by (A) (A) contrary to (A), (A)) = 1. Now 
let w(A) be the minimum polynomial of over &. Then (A) == " (A) (A) 
= (A). It follows from this that (u(A), »’(A)) #1 ife>1. Thus 
¢=1 and 2" satisfies an irreducible polynomial over PY. Hence L = PY (@¥) 
is a field. We have therefore proved that any indecomposable composite of P 
is irreducible and hence any composite is semi-simple. 

We now recall the definition of a hypergroup (with an identity) H asa 
system in which a product of pairs equal to a subset of H is defined. If A and 
B are subsets of H, we define AB to be the set of elements contained in all 
products ab, @ in A and b in B. It is assumed that 1) the product is 
associative: the set (ab)c = a(bc), 2) there exists an identity 1 in H such 
that a1 = 1a =a for all a in H and 3) for each a there is an inverse b such 
that ab and ba contain 1. 

Suppose now that H is a set of non-singular simple composites of P having 
the following properties: 1) If T, and T, belong to H, then T, X I: is the 
least common cover of certain Tj in H. 2) H contains the identity composite. 
3) H contains the inverse I-' of any f in H, Then if we define IT, to be 
the set of simple composites contained in H and covered by T, X Io, it is easy 


to see that H is a hypergroup. We shall therefore call a set of non-singular 


| 


26 N. JACOBSON. 


simple composites with the properties 1), 2) and 3) a hypergroup of simple 
composites. 

Now let H bea finite hypergroup of simple composites and let f =(K, 8S, 7) 
be the least common cover of all the Tj in H. If (Ki, Si, Ti) are the indecom- 
posable components of I, it is evident that each T; is covered by one of the 
(Ki, S8i,Ti) no two T; are covered by the same (Ki, Si,T;) and no two 
(Ki, 8i,Ti) cover the same Tj. It follows that the T; coincide with the 
(Ki, 8i,Ti). Hence by Theorem 19, (K,S,7) is a Galois composite and 
since its indecomposable components are simple, (K,S,7) is semi-simple. 
Let ®y be the field of fixed elements relative to the T;. Then ®y = ry and 
hence P is finite and separable over ®y. Conversely suppose that P is finite 
and separable over ® and let T be the Galois composite such that #r = ®. 
Then we have seen that T is semi-simple and it follows from Theorem 19 that 
the indecomposable components Tj of I form a finite hypergroup H of simple 
composites. Evidently iy =®. We have therefore proved 


THEOREM 21. Let P be a fixed field and H a finite hypergroup of simple 
composites of P. Then if ®y denotes the field of fixed elements under all the 
composites in H, P is finite and separable over ®y. The correspondence 
H + ®y is (1—1) between the finite hypergroups of simple composites of P 
and the subfields ®y over which P is finite and separable. H, = H, if and 
only if S 


13. Normal fields. We shall call a composite (LZ, U,V) one-dimensional 
if (L:P’) =1. Evidently this implies that (Z,U,V) is simple. With this 
definition we have 


THEOREM 22. A necessary and sufficient condition that a field P finite 
over a subfield ® be normal over ® is that every simple composite (L,U,V), 


whose field of fixed elements contains ®, is one-dimensional. 


Suppose that P is normal over ® and that (L, U,V) is a simple composite 
whose field of fixed elements contains ®. Let « be an element of P and let 
#(A) be its minimum polynomial over ®. Since P is normal, »(A) = (A — %) 

(A — Gn), % =a, in P[A]. Hence p’(A) = — a"). Since 
(a¥) 0, — a") = 0 and since is a field a’ = a," for a suitable 
Thus PU = PY and (L: —1. 

Next suppose that for every simple (L, U,V) whose field of fixed elements 

contains ®, (1: P”) =1. Consider the Galois composite T = (K,S, 7) such 


® Cf. Kaloujnine, lec. cit.*. 


| 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL FIELDS. 


that Let K=—K, Kz where the Kj; are indecomposable. 
It N - the radical of K, N=N, ®:-: @ Ns, Ni the radical of K;. We define 
the S;, 7; as in 11 and consider the simple composite (Ki — Ni, Si, Ti) 
where S;, 7; are obtained by first applying Si, 7; and then the natural homo- 
morphism between A; and K;—N;. By our assumption for any « in P, 
for a suitable Bi. Hence (aS+— B;7+)"*=0 for some ni. We 
assume that n; is minimal and we may assume also that the notation has been 
chosen so that Bi =: = Br, Bra =" = Brar, = and 
Consider the (A) = (A—8,7)™- (A—8:7)™. We wish to 
prove that »7(A) is the minimum polynomial over P? satisfied by aS. We 
note first that — §,7#)™- (aSi — contains the factor 
(Si — Bi7')" —0. Since (K,S,T) is the least common cover of the 
(Kj, this implies that = 0. Now let v7(A) be the minimum 
polynomial of a over We recall that if 1,@,- - are linearly in- 
dependent over then 1, - -, are arly independent over P?. 
This implies that v7(A) has coefficients in ®? = 5, Hence v(A) is the mini- 
mum polynomial of over ®. Since = 0, =0. This implies 
that v7*(A) is divisible by (A— Bi7‘)"* and hence that v7(A) is divisible by 
(A—Bi7)". It follows that v7(A) is divisible by »7(A) and since v7(A) is 
minimal, =v7(A). From the factorization p7(A) = — we 
obtain »(A) = (A — 6;)". Thus the minimum polynomial of « over ® fac- 
tors into linear factors in P[A] and so P is normal over ®. 

Let (L,U, !) be any non-singular one dimensional composite of P. Ther 
Since =1 also, If we apply the isomorphism 
V- to L, we see that (L,U,V) is equivalent to the composite (P,A,1) where 
A = UV™ is an automorphism in P and 1 is the identity automorphism. Con- 
versely if A is any epee (P,.4,1) is a non-singular composite. It is 
clear that (P, A, = (P,1, A). Hence the inverse of (P, A, 1) is 
(P, A, 1) (P, 1, y be directly from the definition of the product 
that (P, A, 1) X (P, B, 1) = (P, A, 1) X (P, 1, B*) (P, A, 
== (P,AB,1). Thus the totality of non-singular one-dimensional composites 
of P is a group under multiplication isomorphic to the group of auto- 
morphisms of P. 

Suppose now that // is a finite hypergroup of one dimensional composites. 
Then it is clear that JJ is a group and by Theorems 21 and 22, P is finite, 
separable and normal over ®y. Conversely if P is finite, separable and normal 
over ®, then the indecomposable components of the Galois composite T such 
that 7 = ® form a group H under multiplication. If (P,A,1) is one of 


27 
le 
) 
l- 
1e 
‘0 
d 

e 
I 

| 


28 N. JACOBSON. 


the simple components of [ then A is an automorphism of P that leaves the 
elements of ® fixed. On the other hand if A is such an automorphism of P, 
(P,A,1) is a composite leaving the elements of ® fixed. Hence (P, A, 1) 
is covered by I and therefore (P,A,1) is one of the indecomposable com- 
ponents of [. This completes the proof of the classical correspondence : 


THEOREM 23. If P is an arbitrary field, there is a (1—1) correspondence 
between the finite groups H of automorphisms in P and the subfields ®y over 
which P is finite, separable and normal, namely, ®y is the set of fixed elements 
under the automorphisms of H and H is the complete set of automorphisms 
leaving the elements of ® fixed. 


We recall that (K:P?) = 3(K;:P7). If P is separable and normal 
over each (Ki; =1 and hence (P:®r) = (K:P*) is the number 
of components (Ki,Si,7;). It follows that (P:@r) is the order of the 
Galois group of P over ®p. 


14. Simple extensions. Suppose that P = (a) a simple extension of 
and let be the minimum polynomial of over Let [= (K,S,T) 
be the Galois composite whose field of fixed elements r=. Then the 
minimum polynomial of # over P? is »7(A) and the degree of »7(A) is the 
dimensionality q of K over P?. If A=(L,U,V) is any composite such 
that 64 = ®, T is a cover of A and hence LZ is generated by a over PY’. Thus 
if v¥(A) is the minimum polynomial of «” over PY and r is the degree of 
v’(A) then the elements 1, - -, form a basis for L over PY. Since 
= 0, v¥(A) is a factor of ~”(A) and hence the polynomial v(A) in 
P[A] is a factor of »(A). Thus we have associated with the composite A the 
factor v(A) of w(A) in P[A]. The composite A; = A, if and only if the 
associated factor v,(A) is divisible by v,(A). We shall show next that every 
factor v(A) of »(A) in P[A] arises in this way from some composite. For if 
e(A) = v(A)vi(A) and B is the ideal in K generated by v7(a%) then 
(K — B,S,T) is the required composite. The correspondence between com- 
posites A such that ®, = ® and the factors v(A) of w(A) in P[A] is therefore 
(1—1). It is readily seen that the indecomposable components of I corre- 
spond to the prime power factors 7i(A)° of w(A) = mi(A)%: in 
P[A]. 

Let v(A) =A"™— be a factor of p(A) and let 
(L,U,V) be the corresponding composite. Then if we turn LZ into a double 
P-module by defining a7 = a¥x = raY and ca = a’r = wal, it may be verified 
that the matrix of relative to the basis - -, (a/)™- is 


| 
| 


AN EXTENSION OF GALOIS THEORY TO NON-NORMAL FIELDS, 


(0 Ba 
1 
J 
If r(A) =A" — 8,A""* —- -— 58, is a second factor of w(A), it determines 


in the same way a composite and a self-representation in which @ is represented 
by the matrix 

1 


L 1 8, ) 
The elements 8; are polynomials in « with coefficients in ®. If we replace 
the Bi in « by the corresponding polynomials in 2°, we obtain the matrix 
a!"XG representing « in the product representation. If p(A) is the minimum 
polynomial of «#”*%, p(A) is a factor of w(A) and the composite associated 
with p(A) is the product of the composites associated with v(A) and the com- 
posite associated with 7(A). 

If we apply the theory of a single linear transformation we see that any 


self-representation «—> is decomposable into “ cyclic” self-representations 
li, where «”+ has the form (22). 


THE JOHNS HOPKINS UNIVERSITY. 


29 
> 
? 
3 
| 


ALGEBRAS DERIVED BY NON-ASSOCIATIVE MATRIX 
MULTIPLICATION.* . 


By A. A. ALBERT. 


1. Introduction. There are four ways to form an n-rowed matrix whose 
determinant is the product of the determinants of two n-rowed matrices, and 
only one of these, the row by column product, is an associative product. The 
row by row, column by column, and column by row products determine algebras 
which are not associative and it has recently been suggested to the author, 
in conversation, that these algebras have applications to some problems of 
physics. 

We shall study the structure of such algebras here with particular atten- 
tion to those algebras obtained by the three non-associative products from 
associative algebras © such that © contains the transpose 2’ of every «x of ©. 
We shall indeed obtain a general structure theory not merely for such algebras 
but for the case where © is any algebra with an involution J. Thus our 
results will include the case where J is the conjugate transpose operation. 
We shall also show that there are linear spaces of real matrices closed under 
row by row (or column by column) multiplication and not under row by 
column multiplication, but that these row by row algebras cannot be semi- 


simple. 


2. Row and column algebras. An algebra © over a field % is a linear 
space of finite order over % which is closed with respect to an operation of 
multiplication which is a linear transformation a— ax over % on ©. We may 
define other algebras whose quantities are those of © but in which multi- 
plication is defined by different linear transformations. In particular let 
A, B, C be any non-singular transformations on © and define an algebra ©, 


whose products a: 2 are given in terms of products in © by 
(1) a:-x = {(aA) (xB) }C. 


Then © and G, are said to be isotopic.’ We shall consider the subalgebras of 


certain isotopes of involutorial algebras. 


* Received January 29, 1943. 
1 For an equivalent definition in a slightly different form see my “ Non-associative 
algebras I. Fundamental concepts and isotopy,” Annals of Mathematics, vol. 43 (1942), 


pp. 685-707. 
30 


if 

| 

i 

i 

| 

| 
i 


ALGEBRAS DERIVED BY NON-ASSOCIATIVE MATRIX MULTIPLICATION. 31 


An involution * J over & of an algebra © over a field % is a linear trans- 
formation J over % on © such that J* is the identity transformation, 


(2) (ax)J = (a/) 

for every da and x in C. Since J is non-singular the algebras 
(3) Cp = Cx = Cx (J), = Cxp(J), 
defined, respectively, by the product operations given in 


(4) r-y=x(yJ), (z,y) =(aJ)y, [2,4] 


are isotopic to ©. 

In the special case where © is the algebra of all ¢-rowed square matrices 
and J is the operation of transposition the product 2-y is the row by row 
matrix product. This suggests the term row algebra (relative to © and J) 
for any subalgebra of Gp(J). Similarly we shall call the subalgebras of 
Cx (J) column algebras and the subalgebras of Cxp(J) column by row algebras. 

A row algebra is thus a linear subspace % of order n over % of an algebra 
€ such that 


(5) r-y=2z(yJ) 


is in & for every « and y of M. In general yJ is not in YM and Y& need not 
form a subalgebra of ©. We shall give an example of such an algebra later. 
Thus (5) does not necessarily define % as an isotope of a subalgebra of C. 
However / is non-singular and the mapping 2— zJ of Mf on another subspace 
XJ of € is one to one. If x and y are in W we have 


(a-y)J = {a(yJ)}J = {(yJ)J} = (yJ, 2d). 


It follows that %J/ is a column algebra and that the correspondence 2 —> zJ 


is an anti-isomorphism. We have proved 


THEOREM 1. Lvery column algebra is anti-tsomorphic to a row algebra. 
In particular, every ©p(/) is anti-isomorphic to &x(J). 


This result reduces our study to that of row algebras and of column by 
row algebras. We shall therefore omit entirely all mention of column algebras 
henceforth in our proofs. 


*Cf. my “Involutorial simple algebras and real Riemann matrices,” Annals of 


Mathematics, vol. 36 (1935), pp. 886-964. 


1 
j 
| 
| 


32 A. A. ALBERT. 


It should also be noted that (x: y)J = y(aJ) =y- a and thus the self 
correspondence «—>2xJ does not in general carry y into either rJ-yJ or 
yJ-aJ. Hence J is, in general, neither an automorphism or anti-auto- 
morphism * of ©p(.J) but is merely a linear transformation carrying products 
x:y into y-x. It is an anti-isomorphism of Gp(J) and the isotopic algebra 


C.(J). 


3. Algebras with a unity quantity. Let © have a unity quantity e so 
that e(eJ) =eJ. Every product is J-symmetric and thus‘ eJ =e. 
It follows that 


(6) e-r=e(rJ) =aJ = (eJ) (tJ) = [e, 2] 


for every x of G. 

If B is an ideal of @p or of Cxp the products of its quantities with e are 
in 8. But then (6) implies that if z is in 8 so isaJ, B= BJ. If y is in © 
and @ is in 8 the product x: yJ = zy is in B, y: zJ = yz is in &, the space B 
is an ideal of ©. Similarly [aJ, uJ] [yJ,2J] = yx and if B is an 
ideal of Gxp it is an ideal of ©. Conversely if 8 — BJ is an ideal of € the 
products 2, [v,y], [y,2] are in 8 for every x of B and y of ©, B is 


an ideal of Gp, Exp. We have proved 


THEOREM 2. Let © be a J-involutorial algebra with a unity quantity. 
Then a linear subspace 8 of © forms an ideal of Gp, Gx, xp if and only if 
B— BJ is an ideal of ©. 


As an immediate consequence we have 


THEOREM 3. Let © be a J-involutorial simple algebra with a unity 
quantity. Then the algebras Gp(J), &x(J), Gxp(J) are simple.° 


* The transformation J is an anti-automorphism of Cy». For 
(x, = { (ad) (yd) = yo = { J} (ad) J} = Lyd, wd). 
*If & is an algebra of matrices and r is the maximum rank of any quantity a of 
(© then ea =a so that e must have rank r. See Theorem 12 for algebras which then 
necessarily contain a symmetric idempotent of maximum rank. 


*If 9 is any central simple algebra and Q{, is isotopic to Qf it is simple. For 


R,© = PR, 9, L, =QL,p and if ¢ is the unity quantity of we have R, = P, 
=@Q, f=eQ", g=eP". Then 7T(9[,) contains P, Q and and T(9,) 
contains 7'(9{). But if 9{ is central simple 7(9[) is the algebra of all linear trans- 
formations on and contains 7'(9[,), T(9[) Qf, is central simple. This 
result seems to have been observed first by R. H. Bruck. lt implies Theorem 3 for 


central simple algebras. 


4 
t 
| 
q 
4 


ALGEBRAS DERIVED BY NON-ASSOCIATIVE MATRIX MULTIPLICATION. 33 


Corotuary. Let © be an associative J-involutorial simple algebra. Then 
Cp(J), (J) are simple. 


4. Semi-simple algebras. The converse of Theorem 3 is not true and 
indeed we may prove 


THEOREM 4, Let C= GS @ GJ be a J-involutorial algebra such that 8 
is a simple algebra with a unity quantity. Then €p, Gx, Gxp are simple. 


For © has a unity quantity and every ideal B of Cp, Ge or Cxp has the 
property that 8 — WJ is an ideal of ©. The non-zero ideals of © are ©, 
SJ, © and the only one of these equal to the transformed space under J is ©. 
Hence 8 = 0, ©, the algebras are simple. 

Consider any /-involutorial semi-simple algebra © with a unity quantity. 
Then C=C, ®:--@S, for simple components G; unique apart from 
their order in the sum. But CJ/= SJ and each 
= ©; for some i. Hence either S| = SiJ or Si G SiJ is a component of €. 
Thus we may write © = B, for components Bj = BjJ which are 
either simple or direct sums as in Theorem 4. Then the 8; define simple 
algebras (Bj)p, (Bj) Moreover bib; = = 0 for every bj in Bj and b; 
int B; Bi. Since Bi B; = we have bi bj bj bj [ bi, b;] 
== [b;, bi] =0 and Gp is the direct sum of its simple components (Bj) p, Cxp 
is the direct sum of its simple components (%j)xp. We have proved that 
Cp, Gc, Cxp are semi-simple. 

Conversely let {= €p or Cxp be semi-simple so that by Theorem 2 if 
€ has a unity quantity the components B; of % are r linear subspaces of © 
which are ideals Bj = of ©, Br, Cp = (Bi)p 
B (Br)p, ep = (Bidep GB (Br)ep. The components Bj; all have 
unity quantities and © will have been proved to be semi-simple when we have 


proved the following 


LemMa. Let © have a unity quantity and Gp or Gxp be simple. Then 
U is semi-simple and is either simple or a direct sum © @ SJ where S ts 


simple, 


We shall be unable to complete the proof of this lemma without a con- 
sideration of the radical of an algebra and so we pass on to this study. 


5. The radical. The radical of an algebra © which is homomorphic to 
a semi-simple algebra is defined to be * the intersection 9 of all ideals B of © 


*See my paper, “On the radical of a non-associative algebra,” Bulletin of the 
American Mathematical Society, vol. 48 (1942), pp. 891-897. 


3 


lf 
)- 
a 
Oo 
| 
i 


34 A. A. ALBERT. 


such that © — % is semi-simple. If © is not homomorphic to a semi-simple 
algebra its radical 9% is the intersection of all ideals B of © such that C—B 
is a zero algebra. 

Let S be any automorphism or anti-automorphism of ©. Then S carries 
NM into a subspace NS of C. Evidently NS is an ideal of GS —C€ and 
CS — NS = C — MNES is semi-simple or a zero algebra according as © — M is 
semi-simple or a zero algebra. By the definition of Jt the set MS contains Ve. 
The order of 9S cannot be greater than that of MR, NS —M. The corre- 
sponence S induces a correspondence So: 


[x] > [z]S, = [28] 


in If then — yS] —0, (xq— y)S is in MN so is 
z—y, [xc] =[y]. Hence Sp» is a one to one correspondence. Thus it is 
clear that So is an automorphism, anti-automorphism, or involution of © — 2M 
according as S is an automorphism, anti-automorphism, or involution of ©. 
We shall use this result only in the case where S =J is an involution and 
shall prove 


THEOREM 5. Let 9240 be the radical of a J-involutorial algebra GC. 
Then N= MNJ, the algebras Np, Rr, Rep are ideals of Gp, Cx, Cep respectively 
such that 


(1) (C—R) — Re, (C— cp Cp — Reo. 


Moreover if © -—MN has a unity quantity the algebras Cp —MNp, Ce —Mx; 
xp — Mp are semi-simple; if © —M is a zero algebra these algebras are zero 
algebras. 


For we have seen that 3%} = VJ, the algebras Np, xp are defined. By 
the proof of Theorem 2 they are ideals of Gp, Gxp respectively. The linear 
spaces ©, Gp, xp coincide and so do MN, Mp, Mep. Thus the difference groups 
C—MN, Cpo—MNp, Cxp-—Mep are the same spaces. But if we define [x]Jo 
= then multiplication in is defined by [x] - [y] [x] {[y]Jo} 
= [r][yJ] = [x(yJ)] = y] and this is the product in ©p— Np. Simi- 
larly the product in (© —Y)xp is that in Cxp—ep. This gives (7). If 
@ —M has a unity quantity it cannot be a zero algebra and is semi-simple. 
Then (© —QM)p and (€ —Y¥)xp have already been found to be semi-simple, 
our result follows from (7). Finally if © — is a zero algebra the products 
[x] - [y] = [2(yJ)] = [x][yJ] = 0. Hence Gp — Mp and similarly 
Cxp — are zero algebras. 


id 
id 
5 
| 
i 
q 


We 


we 


we 


we 


we 


ALGEBRAS DERIVED BY NON-ASSOCIATIVE MATRIX MULTIPLICATION. 35 


We may now prove our lemma. Let @p or Cxp be simple and 9 be the 
radical of ©. By Theorem 5 we have #— MJ, Np is an ideal of Cp, MNep is 
an ideal of Gxp. But 3 is a proper ideal of ©, the only proper ideai of a 
simple algebra is zero. Thus 3}—0, © has a unity quantity and is not a 
zero algebra, © is semi-simple. The simple components of © define simple 
components S = GJ of Gp and Gp or simple components S G GJ of Cp and 
Cxp. But then C= S or S G SJ. We have proved the first part of 


THEOREM 6. Let © be a J-involutorial algebra with a unity quantity. 
Then © is semi-simple if and only if the algebras ©p, x, Gp are all semi- 
simple. If © is not semi-simple and MN is its radical the algebras Np, Mx, 
Mp are the respective radicals of Gp, Gx, Gxp and the difference algebras 
—M, Co — Mp, Cx — Cp — are all semi-simple. 


To prove the last part of this theorem we note that © is homomorphic 
to ©—M and thus ©—Y has a unity quantity, the algebras Cp — Mp, 
xp — xp are semi-simple. Every ideal of Cp is an algebra Bp where B is 
an ideal of ©. If Bp is the radical of Cp it is contained in Jp and C — B is 
semi-simple. As in the proof of Theorem 5 we have ©p— 8p = (C— B)p 
and by the first part of our theorem ©—% is semi-simple, 8 contains %. 
Hence 8 =. Similarly Mp is the radical of Cxp. 

In the case where © is associative we may derive these results without the 
hypothesis that © has a unity quantity. We first have the 


Lemma. Let © be any nilpotent algebra. Then Gp, Gx, Gxp are 
nilpotent. 


For every product of r factors in either ©p or Gxp may be expressed as a 
product of r factors in © and is zero if © has this property. 
We next prove 


THEOREM 7. Let © be a J-involutorial associative algebra. Then © is 
semi-simple if and only tf Gp, Ge, Gp are all semi-simple. If RA 0 is the 
radical of © we have N=NI and Np, Me; Nep.are nilpotent and are the 
radicals of Gp, Sx, Cep respectively, the algebras —N, Cp —Np, Ce —MNx, 
Exp — Mp are semi-simple. 


For if © is semi-simple it has a unity quantity and Gp and Cxp are semi- 
simple. Conversely let Jt be the radical of ©. Then we have seen that Jt = NJ 
and that Np contains the radical of of For is a semi- 
simple algebra with a unity quantity, ©»>—QMp and Cxp—Mxp are semi- 
simple. If ©p or Gxp is semi-simple its ideals are semi-simple whereas 


5 
2 
i 
f 
f 
) 
i 
| 5 
4 


36 A. A. ALBERT. 


Np, Nxp are zero or nilpotent ideals. Hence Jt 0, € is semi-simple. If 
neither ©, Gp nor Cxp is semi-simple and 8 is the radical of ©p then 
Gp —MNp is semi-simple, B is contained in Np, C —B is semi-simple. But 
Np—®B is zero or a nilpotent ideal of the semi-simple algebra Cp — 8, 
Np —BV=—0, B—MNp. Similarly is the radical of 

It should be noted that the results of this theorem depend only on the 
property that the radical of © is nilpotent and that when € is semi-simple 
it has a unity quantity. Hence they hold also under the weaker hypothesis 
that © is an alternative algebra rather than an associative algebra. 


6. Row algebras. We shall begin our study of subalgebras of ©p by 
exhibiting certain revealing examples of such algebras. We first suppose that 
® is any algebra whatever and a—aS is an anti-isomorphism of © on an 
algebra which we shall designate by GS. Construct the direct sum 
© & @ GS and define (c, + = + cz for every c, of and of 
GS. Then J is an involution of ©. However c, = c¢,(¢.S) = 0 for every 
c, and cz of &. It follows that G forms a subalgebra M of Cp which is a 
zero algebra. Ilowever G ~ GJ, UA Bp for a subalgebra B of ©. Never- 
theless the space G is a subalgebra of ©. 

We next suppose that © is the set of all 2n-rowed square matrices and 


that J is the transformation «— «J = x’ of transposition. We let Mt be the 


00 


for A an n-rowed square matrix, and 


00 10 


where J is the identity matrix of n rows. Then 9 is a subalgebra of C, Wt is 


set of all matrices of the form 


simple, f=e-e—e(eJ) is the unity quantity of Yt. Also 


(00\_ (01\(4'0)_ 
We define % as the linear subspace 


M+ eF 


of ©. Then % has order n?+1 and since WJ and e~eJ the set 


4 
if 
4 
4 | 
3 
(3 
i q 
| 
| 


ALGEBRAS DERIVED BY NON-ASSOCIATIVE MATRIX MULTIPLICATION. 37 


UA AT = W-- (e/)%. Moreover if a and b are in Mt and @ and B are in § 
we have 


(a+ ae): (b+ Be) =a-b+ aBe 


is in Yt. It follows that % is a row algebra. However if & is a non-scalar 


A0\/01 0A 

00/\00/ \00 
is not in % and Ff is not a subalgebra of ©, % 54 Dp for any subalgebra 
© = DJ of 

The subspace Yi defines a subalgebra Mp of M. Also = Mp 
since Wt is simple. But M-? S Mp, W-? Wp. Let B be any other 
non-zero ideal of Y%. Then BS Wp implies that B= Wp. Hence B contains 
a quantity a-++ ze with a in Mand «0. But (a-+ ae): (ate) =f is in 
Band contains YW, B contains + a) —a = ae, B® Hence WM and 
Mp are the only non-zero ideals of “Mf. Since Y is not semi-simple its radical 
is Mp = M-*, — W — — is a zero algebra. 

Our final algebra is the direct sum 8 = GS @ MY where W is the algebra 


above and S = Wop for a total matric algebra Yty. Thus S consists of ali 


matrix the product 


3n-rowed square matrices 


ze 0 


() 


As before 6 = S/, BA YM/, BAD, for any subalgebra D — DJ of the set 


& of all oe square matrices. But 8 is a row algebra. 

The ideal Wp of YW is one ideal of B, WM is an ideal of B, S is an ideal 
of B. These are the only ideals of B contained in either 2% or S. Let 20 
be any ideal of B not contained in Mf or in S. Then & contains a quantity 
z-++ a where z 9 is in S anda=0 is in M. But if g is the unity quantity 
of we have (2 + ag =z~0 is in My. Now this 
product is in 2 and so is [aJ +z] =y|(aJ) (zJ)] = for every y and 
x of My. Since Wty is simple L contains Wo. Hence C= S @ Xo where 
2, is a subspace of Mf. Clearly % is an ideal of WM, Lo—W or Mp, 
© S+ or B. 

The algebra is homomorphic to 8 — (S + = — Me and is not 
semi-simple. The only other difference algebras to which 8 is homomerphic 
are V— SF = V—-A = S, BVB— Mp = S (A — My) and of these only 
¥—M is semi-simple. Hence YM is the radical of B. It follows that the 
radical 2 of a row algebra 8 need not have the property 3 — MNJ. 


a 
e 
s | 
t 
q 
f § 


38 A. A. ALBERT. 


We have now seen that if % is a row algebra neither & nor its radical 
need have the property % = WJ. Thus this property is a consequence only of 
additional hypotheses. It is a most important property since it implies that 
2: yJ = ry is in U, — Dp for a subalgebra D of ©. Let us now prove 


THEOREM 8. If % is a row algebra its ideal U-* = (M-*)J and has the 
property U-? == Dp for a subalgebra D of ©. 


For (x-y)J =y-2 is in %-? for every x and y of UM, (M-*)J is contained 
in Since J? we have = (Y-?)J. 
We have the immediate consequences 


I. Let be a row algebra. Then U=—°AJ = Dp for 
a subalgebra D of ©. 

Corotiary II. Every semi-simple row algebra % is an algebra Dp where 
D is a semi-simple subalgebra of © and YX is isotopic to D. 


~ 


Corotiary III. Let € be associative and U be a semi-simple row algebra 


Then % is isotopic to an associative semi-simple algebra D, XM = Dp. 


For row algebras in which % may not be equal to %-* we may prove 


THEOREM 9. Let a row algebra U% be homomorphic to some semi-simple 
algebra, N be the radical of A, G be the set of all quantities x of A such that 
aJ is in U, No be the intersection of MN and G so that R—MN, + N,. Then 


G+ 


® is an ideal of M containing U-*, % — G is a zero algebra. Moreover No is 


an ideal of X and of G such that 
R= G—MNpo. 


For by Theorem 8 we have %-° = (%-*)J, G contains Y-? and thus AG 
and GY. Hence & is an ideal of 2% and 2% — G is a zero algebra. Since both 
and contain WI and their intersection MN, contains AM, and MY, 
M, is an ideal of M and of G. The algebra I —M® is semi-simple, 2% —M 
= (2% —M)-* is an algebra whose quantities are sums of products [a][2| 
= = [b] where is in &. But then every quantity of has the form 
b+ f where 6 is in & and f is in Mt, we may take f in 9,, Y= G+ N,. If 
[b] —[b,] where 6 and b, are in & then 6 — }, is in N and in G, b —d, is 


i 
4 
q 
q 


ere 


ra 


ple 
hat 


1s 


oth 
oll, 
[x | 
rm 

If 


is 


ALGEBRAS DERIVED BY NON-ASSOCIATIVE MATRIX MULTIPLICATION. 39 


in Jt). Thus the mapping [b] — 0 of the classes [b] of 21 — 9 on the classes 
{b} of G© — M, is one to one and defines an equivalence of these algebras. 

In the example we have given of such an algebra the algebra & is semi- 
simple but Jt, is not zero. Thus Jt, contains the radical of G and may contain 
it properly. 

In closing this part of our discussion let us observe that if a is the n-rowed 
square matrix with unity in the first row and second column and zeros else- 
where and if J is transposition then [a,a] = (aJ)(aJ) =0. Then a spans 
a linear space % of order one such that 2% 4 YJ, Wf is a column by row algebra 
and is actually a zero algebra. We also note that if % is any column by row 
algebra then the set U of all quantities z of %& such that zJ is in Y& is a sub- 
Dep. For if and y are in so are aJ and wi, [a, y]J 
= [yJ,zJ]| is in M, [v,y] is in B. However it does not seem likely that B 
is an ideal of % nor that any results like those derived above for row algebras 


algebra of YU, B= 


are obtainable. 


7. Real algebras. <A field % is said to be real if no sum of a finite 
uumber of non-zero squares of its quantities is zero. If A is any matrix and 
A’ is its transpose the i-th diagonal element of B= AA’ is the sum of the 
squares of the quantities in the i-th row of A. If A has rank r the symmetric 
matrix B has rank at most + and is congruent to a diagonal matrix PBP’ 
= (PA)(PA)’ where P is non-singular. But PA has rank r and at least 
r non-zero rows, PP’ has at least r non-zero diagonal elements. Thus the 
rank of PBP’ and of B is at least r, AA’ has rank r. It follows very simply 
that a field F is real if and only tf the rank of every matrix A coincides with 
the rank of AA’. We now note the known ? 


LeMMA. Let © be an algebra over a real field § of n-rowed square matrices 
under ordinary matrix multiplication and let the transpose xJ of every matrix 
of © be in ©. Then © is a semi-simple algebra, C= S, for 


simple algebras S; = SJ. 


For we have seen that the radical 9t of the associative algebra © has the 
property 3% — MJ. If R40 there is a non-zero matrix x of rank r in &. 
Then y = «(x/) is in % and is a nilpotent symmetric matrix of rank r, y* = 0. 
But y” = yy’ has rank 7, y™ has rank r for every k, k may be chosen so that 
2* > t, a contradiction. Hence 3{=—0, € is semi-simple. If ©; is the unity 


*Cf. page 283 of H. Weyl, “On the use of indeterminates in the theory of the 
orthogonal and symplectic groups,” American Journal of Mathematics, vol. 63 (1941), 
pp. 777-784. We shall give a brief proof here of this result for the sake of completeness. 


of 
at 
he 

ed 
or | 

| 


40 A. A. ALBERT. 


quantity of ©; then e;/ is the unity quantity of GjJ and we have already 
seen that if S;J + S; then GS: is in the direct sum relation to S;. But then 
) = 0 which is impossible. Hence = 

Our final result will be 
Let X% be a linear space of real matrices forming an 


THEOREM 10. 
Then XM contains 


algebra under row by row multiplication such that %-* 
the transpose xJ of every x of XA, Y% is semi-simple, U—= Bp is the isotope of 
an associative semi-simple algebra B=S, QB: for simple com- 
ponents Si = SiJ such that = (Si1)p (Sr)p. 

This result follows from the fact that necessarily 2% —%p and that if 
B — BJ then B is semi-simple. It gives a complete construction of all row 
algebras 9{ = 9-? as well as of the ideal G of Theorem 9 which is now seen 


to be semi-simple in case % is real and J is transposition. 


THE UNIVERSITY OF CHICAGO. 


| 

| 

I 

| 

| 


ON THE FORMS OF THE PREDICATES IN THE THEORY OF 
CONSTRUCTIVE ORDINALS.* 


By 8. C. 


In the system S; of notation for ordinal numbers, the class O of the 
natural numbers which represent ordinals, and the partial ordering relation 
<o between such numbers, were defined by a transfinite induction.* In this 
paper, we shall prove that the predicates aeO and a<ob are expressible 
explicitly in the respective forms (x) (Hy) R(a,z,y) and (x) (Ly)S(a, b, y) 
where # and S are primitive recursive predicates. The result is used elsewhere 
to exhibit the incompleteness of ordinal logics under a general theorem on 
recursive predicates and quantifiers. The proof illustrates a technique to 
which recourse may be had generally in attempts to reduce inductive definitions 
to explicit ones. Some simpler applications of the technique are given first, 
as well as a résumé of requisite notions and results. 


1. Recursive definition. We shall deal with number-theoretic functions, 
the independent variables of which range over the natural numbers, with the 
values of the functions being taken from the same domain; and with number- 
theoretic predicates, that is, propositional functions of natural numbers. 

Such functions and predicates can be defined in various ways. It often 
happens that the definition which is given for a function provides a uniform 
method which would enable one, given any set of arguments, to ascertain the 
corresponding value of the function in a finite number of steps. Likewise, 
the definition of a predicate may provide effective means for reaching a de- 
cision respecting the truth or falsity of the proposition taken as value of the 
predicate for a given set of arguments. Under these circumstances, we say 
that the function or predicate is constructively defined. 

We shall refer to primitive recursive functions and predicates, and to 
general recursive functions and predicates. These are the functions and 
predicates definable by two particular types of constructive definition which 


* Received October 15, 1942; Preliminary report presented to the American Mathe- 
matical Society, April 3, 1942. Theorems 1 and 2 were obtained in 1939-40 in progress 
of research at the Institute for Advanced Study supported by the Institute and the 
Alumni Research Foundation of the University of Wisconsin. The bracketed numbers 
refer to the bibliography at the end. 

[9] p. 155. 2 [10] §§ 5, 15. 


4] 


i 

| 


42 S. C. KLEENE. 


have been described elsewhere, the second of which includes the first.* From 
work of Church and Turing, it appears that any function or predicate which 
is constructively definable is general recursive.* This lends the general re- 
cursive functions their chief interest, while moreover the cursory reader may 
use it as a principle in verifying our statements that certain functions and 
predicates are general recursive. The additional fact that some of them are 
primitive recursive is incidental for this paper. 


2. Explicit definition. After some predicates and functions have been 
defined, then other predicates can be defined explicitly by giving for them 
expressions of finite length built up, with the use of variable natural numbers, 
in terms of previously defined predicates and functions, and the operations 
of logic. 

The operations of logic which we shall consider are the propositional 
connectives 

V (or), & (and), — (not), — (implies), 


and the quantifiers, 
(Ex) (there exists an x such that), (x) (for all 2). 


The predicates which can be introduced by explicit definitions, when we use 
these operations of logic and start from the general recursive predicates and 
functions as given, including the recursive predicates themselves, the author 
has called elementary. 

Explicit definition may also be considered under other restrictions as to 
the terms. 


3. Reduction. It may happen that two distinct definitions introduce 
predicates which can then be shown to be equivalent. If extensional termi- 
nology is used, the definitions are recognized as two definitions of the same 
predicate; or, in the language of conditions, either one becomes a necessary 
and sufficient condition for the other. 

In the case that the one predicate is expressed explicitly in a certain form 
or in certain terms only, we shall speak of this relationship as a reduction of 
the other predicate to that form or to those terms; and the other is said to be 
expressible in that form or in those terms. 

For example, it is known that every elementary predicate is expressible 
in terms of the functions + (plus), - (times) and the predicate = (equals), 


[5] §§2, 9, [7] pp. 729-31, [10] §§ 1, 2. [10] 9 22. 


i 

| 

| 


FORMS OF THE PREDICATES IN THE THEORY OF CONSTRUCTIVE ORDINALS, 43 


and the operations of logic.’ One can thus get along with these three very 
simple constructive functions and predicates, at the cost of increasing the 
numbers of operations of logical combination which will be required to express 
given predicates in terms of them. 

After some preliminaries, we shall describe in 6 a reduction of opposite 
tendency. 


4. Advancement of quantifiers. The following equivalences, where A 
is independent of x, hold in the classical logic. 


(1) (Bx) A(«)\V (Ex) = (Bx) (A(2)V B(a)), 
(2) (Ex) B(x) = (Ex) (AV B(2)), 
(3) A & (Hx) B(x) = (Fr) (A & B(z)), 
(4*) (Ex) A(x) = (x)A(z), 

(5) & (x) B(x) = (x) (A(z) &B(z)), 
(6) A & (x) B(x) = (x) (A &B(z)), 

(7*) B(x) = (x) (AV B(z)), 
(8*) A(x) = (He) A(z). 


By applying them from left to right, using as required the associative and com- 
mutative laws for \/ and &, we can advance quantifiers in an expression across 
the connectives \/, & and — toward the front or exterior of the expression. 
With the classical equivalence 


(9*) A>B=<AvB, 


we can do the same for —. For example, using (9*) with (2) and (7*), 
respectively, A being independent of 7, 


(10*) A— (Fr) B(x) = (Fr)(A-> B(z)), 
(11*) A— = (A> B(z)). 


In the intuitionistic logic, in which the classical law of the excluded 
middle A\/ A is not postulated, the starred equivalences fail to hold in general. 
However, under special conditions various ones of them hold intuitionistically, 
as a consequence of the fact that, when A is known to be general recursive, 
then A\/A can be proved. To begin with, (7*) and (9*) hold intuitionisti- 
cally, if A is recursive; and hence (10*) and (11*) do. Again, the following 
form of (7*), A and C being independent of 2, 


(12*) (A&C) B(x) = (2) ((A&C)VB(z)), 


5 [4] pp. 191-3, [6] pp. 412-21, [8]. 


44. S. C. KLEENE. 


holds intuitionistically, if A is recursive and A and B(z) are mutually 


exclusive. 


5. Contraction of quantifiers. Let (2); denote the exponent of the i-th 
prime in the representation of x as a product of powers of distinct prime 
numbers, for x and 7 positive.° This exponent is 0, if the i-th prime does not 
divide z. For c—0 ori =O, let (x); be 0. Then (x); is in fact a primitive 


recursive function of z and 7. Moreover, 
(13) <a. 


Consider the ordered set of n quantities (r),,---,(2)n. This ranges 
with repetitions over all »-tuples of natural numbers as x ranges over all 


natural numbers. Therefore 


Indeed, the set (z),,- + +, (2), takes on each n-tuple as value for in- 


finitely many values of 2. Hence we may exclude any finite number of values 
from the range of z in (14) and (15). For example, excluding the value 0, 


(16) (Ban) +, = (Er) [A((2)1,°° +, (2)n) & 2 KO], 
(17) (41) +, tn) = (2) °°, (2) n) Vo =O). 


6. Recursive predicates and quantifiers. Consider any predicate ex- 
pressed in terms of general recursive predicates and quantifiers only. The 
expression for the predicate has the form of a recursive predicate with zero 
or more quantifiers prefixed. By the contraction laws (14) and (15), con- 
secutive occurrences of like quantifiers can be eliminated without altering the 
recursive character of the operand of the quantifiers. Hence for a predicate 


of a single variable a only the following normal forms need be considered : 


(Er) R(a,x) (x) (Ly) R(a, x,y) (Fx) (y) R(a, 2, y,2) 


R 
(«) (x) R(a,x) (Ex) (y)R(a, x,y) (Ly) (2) R(a, 27, y,2)° 


where the # for the form is general recursive; and similarly replacing a by 
* *,@ for predicates of n variables -,@n. 
In the classical logic, furthermore, given any expression for an elementary 


6 This (x), is the iGla of [7] p. 732, which is a modification of the iGla 
of [4] p. 182. 


: 

‘ 
| 
i 

t 


\w 


FORMS OF THE PREDICATES IN THE THEORY OF CONSTRUCTIVE ORDINALS, 49 


predicate as described in 2, the quantifiers can be advanced to the front as 
described in 4. By theorems on the composition of recursive predicates with 
recursive functions and the operations of the propositional calculus, the 
operand of the prefixed quantifiers after the advancement constitutes a simple 
recursive predicate.‘ Hence, classically, the normal forms just described suffice 
for the expression of every elementary predicate. 

In this reduction for elementary predicates, the réle of the logical opera- 
tions is minimized at the cost of increasing the complexity of the recursive 
predicates. But if the notion of a given constructive predicate is accepted as 
clear, then these reduced forms stand out as particularly clear for the inter- 
pretation of the predicates. 

No further essential reduction in the list of the normal forms is possible, 
by the theorem of the author which says that, classically, to each of the listed 
forms after the first, there is a predicate expressible in that form but neither 
im the dual form nor in any of the forms with fewer quantifiers.* The theorem 
also has an intuitionistic version. In connection with this theorem, the author 
has shown how a number of questions in the foundations of mathematics turn 
upon a predicate’s being expressible in a certain one or another of these forms.° 

If a predicate is expressed in one of the forms after the first, then it is 
always possible to replace the 2 by another R which is primitive recursive, 


retaining the form.’° 


7. Inductive definition. /nductive definition is most familiar in the 
case that a class is being defined, but the method applies equally well to 
predicates of more than one variable. The general features of an inductive 
definition are these. First, there are direct clauses. Some of these (basic 
clauses) state that the predicate is true for certain sets of arguments; others 
(inductive clauses) that if it is true for certain sets of arguments, then it is 
true for others related in a certain way to the former. Then there is an 
extremal clause, which states that the predicate is true only for those sets of 
arguments for which its truth is required by the direct clauses. These features 
are illustrated in the examples to be given presently; and our work will be 
with the examples. 

The direct clauses of an inductive definition may be constructive in the 
sense that any proposed particular application of one of them can be recog- 
nized as legal or illegal; or some of them may be non-constructive. When one 
of the clauses requires as premises for its application the truth of the predicate 


7 [4] I-IV. §§ 11-17. 
®(10] $5. 110] $9. 


| 
f 
| 


46 S. C. KLEENE. 


for infinitely many sets of arguments, the inductive definition is transfinite. 

In this paper we are concerned with the problem of reducing inductive 
definitions to explicit ones; more specifically, with the problem, to a given 
inductive definition of a predicate, of finding an equivalent elementary one. 
By 6, were such found, then classically we could reduce it to one of the normal 
forms in terms of recursive predicates and quantifiers. However the technique 
which we present below for reducing inductive definitions is an extension of 
that of 6, and when it succeeds we arrive at the latter directly. As it happens, 
for our examples, the reduction requires only intuitionistic methods. 


8. The concept of formal provability. Inductive definitions of meta- 
mathematical predicates occur in the description of a formal deductive system. 
The predicates become number-theoretic ones when a Gédel arithmetization of 
metamathematics is applied to the system.** Let us consider the arithmetized 
provability predicate. Suppose that the system has one and two premise rules 
of inference. We shall then have given three predicates: —A(a): a is an 
axiom (i.e., a is the Gédel number of an axiom of the unarithmetized formal 
system) ; B(a,b): a is an immediate consequence of } by a one premise rule 
of inference; C(a, b,c): a is an immediate consequence of b and ¢ by a two 
premise rule of inference. We shall suppose that the predicates A, B and C 
are general recursive (in effect, that the formal system has constructive rules). 
The four clauses which follow constitute the inductive definition of the 
predicate: — P(a): a is provable. 

1. If A(a), then P(a). 2. If P(b) and B(a,b), then P(a). 3. If P(b) 
and P(c) and C(a,b,c), then P(a). 4. P(a) only as required by 1-3. 

Rewriting this definition of P(a) in the form of an equivalence, using 


the logical symbolism, 


(18) P(a) & B(a,z)] 
V (£2) & P(y) & C(a, x,y) 


We now conjecture that P(a@) is expressible in the form (£xz)R(a,x) where 
Ff is general recursive ; and substitute tentatively the expression “(Hz)R(a,2),” 
with “ 2” representing an as yet undetermined predicate, for “ P(a)” in (18). 
Thus, making suitable changes in the bound variables, 


(19) =A(a) vy & B(a, z)] 
V (Ex) (By) (42) R(2, z) & (Hz) R(y,z) & C(a, x, y)]. 


11[4], [10] § 4. 


| | 
| 
| 


US 


FORMS OF THE PREDICATES IN THE THEORY OF CONSTRUCTIVE ORDINALS, 47 


Advancing quantifiers to the front by (1)-(3), with a change of notation in 
the bound variables, 


(20) (Hx) R(a,x)= (Ex,)[A(a) V[R (11, 22) & B(a, 
V [R(21, & R(x2, & C(a, £2) |]. 


Thence, contracting by (16), 


(21) (Ex) R(a, 2) = (Ex) {[A (a) V[R((2)s, (#)2) & B(a, 
VIR((2)1, (2)s) & B((a) (@)4) & O(a, (a) 2) ]] & 0}. 


Striking out the prefixed quantifier (Hx) from both members of the last 
equivalence, 


(22) R(a,x) =[A(a) V[R((2)1, (x)2) & B(a, (2)1)] 
V[R((2)1; &R((x)2, (x) &C (a, (x)1, (Z)2) J] & 


This equivalence determines R(a,xv) as a general recursive predicate, since 
for ~ 0, R(a, x) is expressed by it in terms of given general recursive predi- 
cates and functions, the operations of the propositional calculus, and the 
predicate R(s,¢) itself for only arguments s,¢t such that t < xz by (13), while 
for x = 0, it makes (a,x) definitely false. That is, we can take this equiva- 
lence as the definition of #, and R will then be general recursive. Then if we 
define P from R by 

(23) P(a) = (Fr) R(a,z), 


by reversing the steps from (18) to (22) we can prove (18) as a theorem 
from (22) and (23) as definitions. But (18) as we know is sufficient to define 
P(a) as a predicate. Thus the predicate P(a) defined by (18) is equivalent 
to the P(a) defined by (22) and (23). Thus we have proved that the P(a) 
ot (18) is expressible in the form (Wz)R(a,x) where RF is general recursive. 

As a matter of fact, by a result of Péter, this R is itself primitive re- 
cursive, if A, B and C are such;** but in any case, by the remark at the end 
of 6, it follows without use of Péter’s result that P(a) is expressible in the 
form (Ex)R(a,x) with an R which is primitive recursive. 

We get another example, if instead of assuming that A(a), B(a,b) and 
C(a,b,c) are general recursive, we assume merely that they are expressible 
in the respective forms (Hr)S(a,z), (Hx)T(a,b,x) and (Ez)U (a,b,c, x) 
where 8, J and U are general recursive. The reduction of P(a) to the form 


1213]. The recursion equations for the representing function of R have the form 
of a double recursion, p. 493, with no nesting of ¢’s, and so by pp. 508-9 the function 
is primitive recursive. 


@ 
f | 
@ 
| 
e 
e 
) 
e 


48 C. KLEENE. 


(£z)R(a,z) where RF is general recursive will go through with this alteration 
in the definition of P(a), since again only existential quantifiers come to the 
front. This means that the form of the concept of formal provability is not 
altered by allowing a non-constructiveness in the rules of the formal system 
£ the exact extent which occurs in the provability concept for a system with 


constructive 


9. General aspects of the problem of reducing inductive definitions 
to explicit definitions. The two examples of 8 illustrate the treatment of 
inductive definitions with constructive direct clauses. In fact, when the notion 
of all predicates definable by successive inductive definitions with constructive 
direct clauses is made precise in a certain quite natural manner, then the 
method of these examples can be used to prove the general theorem that all 
such predicates are expressible in the form -,an,2) where is 
general recursive.” 

Conversely, every predicate expressible in this form can be introduced 
by a series of inductive definitions with constructive direct clauses. Such a 
series is obtainable by the process of arithmetizing the metamathematical 
definitions for a suitable formal system which is consistent and complete 
for the proof of formulas expressing the true values of the predicate 
(2) (a,,° Gu, 2). 

In the principal example of this paper, which follows, we accomplish the 
reduction of a certain particular inductive definition with non-constructive 
direct clauses. ‘ 

However, our technique does not afford a general solution of the problem 
of reducing inductive definitions with non-constructive direct clauses to ex- 
plicit ones, but is merely one which heuristically may lead to a reduction in 
particular cases. To begin with, the technique cannot always succeed in 
accomplishing reduction, since it is possible by inductive definition with non- 
constructive direct clauses to define a predicate which classically is non- 
elementary.’° Moreover, the failure by the technique to accomplish the 
reduction of a given inductive definition would not in itself establish the 
non-elementary character of the predicate. No technique can exist which 
would afford a general solution of the problem in this sense.*® 


13 For another treatment of this, see [10] § 14. 
14 This is contained in a maruscript of the author’s which is not yet in form for 


publication. 
15 The definition of M(a,k) in [10] §17 can be written as an inductive definition. 


*° Using the predicate 7,(z,2,y) from [10] §4 or that from [7] §2, we can 
introduce a parameter e into the inductive definition of M(a,k) so that the resulting 


FORMS OF THE PREDICATES IN THE THEORY OF CONSTRUCTIVE ORDINALS. 49 


The foregoing remarks on the non-constructive case are not precise, since 
we have not given an exact general description of inductive definition; but 
such a description can be given in a natural manner for the purpose of the 
present discussion. In particular, it would be required that the predicates 
presupposed in the statement of the clauses should be either elementary or 
defined by previous inductive definitions of the same type. 

While the intuitionistic methods suffice for our example, it appears quite 
possible that the reduction technique might succeed for some examples classi- 
cally but not intuitionistically. We know that the reduction of elementary 
predicates to the normal forms which was described in 6 is not always valid 
intuitionistically; and no reason is seen why there may not be inductively 
defined predicates reducible to the normal forms classically but not even 


elementary from the intuitionistic standpoint.*? 


10. Recursive definition of a function by a number. We shall give a 
minimum of preliminaries to the inductive definition of our principal example, 
confining ourselves chiefly to items relevant to the reduction. 

We use, from the theory of general recursive functions,’* a certain primi- 
tive recursive predicate T,(z,7,y) and a certain primitive recursive function 
U(y). We shall write here simply “7” for “7,.” 

Where A(a2,y) is any number-theoretic predicate, we use pyA(az,y) to 


denote the least y such that A(az,y) is true. Thus pyA(a,y) is a function 
of the remaining variable x, defined for those natural numbers as values of z 
for which (Hy)A(z,y) is true, and undefined for other values of z. 

At this point we are liberalizing our use of the term function to allow 
partial number-theoretic functions which are not assumed to be defined for 
all arguments in the domain of natural numbers. The partial functions, as 
we use the term, include the ordinary or complete functions. 


predicate M,(a,k) has the properties 
[(@)(@Sk>T, (e,e,7))]7M,(a,k) = M(a,k) 
and 
[ (Bx) (e,¢e,a))] 7M, (a,k +1) =M,(a,k). 

Then if (Ex)T, (e,e,a@), the predicate M,(a,k) is elementary; while if (a) 7, (e, €,@), 
the predicate M,(a,k) is equivalent to M(a,k) and hence classically non-elementary. 
By [10] §5 or [7] §2, the predicate (Ex)T, (a, a,@) is non-recursive. Hence by 
[1] §7 or [10] § 12, there can be no general technique for determining whether a given 
number e has the property (H«)7,(e,e,v), and therefore whether the predicate 
M,(a,k) is elementary. 

‘7 These remarks on the intuitionistic case are conjectures, which perhaps the 
theory being developed in [11] and [12] may provide the means for resolving. 

18 [10] §§ 4, 7. 


+ 


1 

e 
t 
1 § 
‘ 


8. C. KLEENE. 


In writing equality relationships between partial functions, we use the 
symbol ~ rather than = to indicate the possibility of indefinition of the 
values. Thus ¢(2) ~w(2), read, according to the context, either as referring 
to a particular x or as referring to any x, means that, for the x considered, if 
¢(x) and (x) are both defined, they have the same value, and if either is 
undefined, the other is’ For example, with the symbol =, the equality 
$(x) ~ $(x) +1 is not contradictory, if (x) is that partial function of z 
which is undefined for every value of x. 

The preceding, and following, remarks are written out for functions of 
one variable, but apply analogously to functions of n variables, for any positive 
integer n. 

An important partial function ©,(z,2) of two variables, which we shall 
here write omitting the subscript “1,” is defined thus, 


(24) @(z,2) ~ U(pyT (2,2, y)). 


Let (x) be a partial function of «. We say that the natural number e 
defines recursively, if 
(25) ~ ®(¢,2). 


A notion of partial recursive function is obtained by extending the notion 
of general recursive definition to partial functions retaining the characteristic 
feature of the former as applied now to the set of the arguments for which a 
partial function is defined. 

According to a fundamental theorem in the theory of general. and partial 
recursive functions,’® if ¢(z) is general or partial recursive, then (25) holds 
for some natural number e. 

Conversely, for any e, the function ¢(x) defined by (25) is partial 
recursive, and therefore in the case that it is completely defined general 
recursive. This follows from the result that for any general recursive A(z, ¥), 
the function pyA(z,¥) is partial recursive.*° 

Note that, if z is a fixed number, the condition that ®(z,z) be defined 
for a given value of x as argument is as follows, 


(26) is defined} = (EHy)T(z, 2, y). 


The predicate T'(z,z,y) is so chosen, in the most recent version ef the 
theory, that for a fixed z and x, T'(z,z,y) is true for at most one y. Hence, 


(27) T (2,2, y) > ®(z,r) = U(y). 


TV, $7. 2017] V, [10] §§ 3, 6. 


50 
| 
| 


FORMS OF THE PREDICATES IN THE THEORY OF CONSTRUCTIVE ORDINALS. 51 


In stating this theory for functions of n variables, we start with a 


T'n(2,41,° *,2n,y) and define from it a ®n(z,%1,° 

There is a certain primitive recursive function S,'(z, y) with the property 
that, if e defines recursively a partial function ¢(a,xz) of two variables 
a and x, then, for each fixed a, S;'(e,a) defines recursively #(a,2) con- 
sidered as function of the one remaining variable z. Similarly there is a 
(2, 415° Ym) for any m parameters and n remaining variables.** 


11. Ordinal representation. By no we shall denote the primitive re- 
cursive function defined thus: =1, (n+ 1)o = 

Let n range over the natural numbers, and yn be a number depending 
on n. By saying that a number y defines y, recursively as function of no, 
we mean that there is a function ¢(xz) which takes as values Yo, 91, Y2,° * ° 
when takes as values 00,10, respectively, and which is defined 
recursively by y. What that ¢(2) may be for other values of 2, i. e:, whether 
or not defined and with what values if defined, is immaterial. 

The class 0 and relation <o are now defined by the following simul- 
taneous transfinite induction. 

01.1¢€0. O2. If ye O, then 2% O and y<o 2. If, for each n, 
yn€O and Yn <o Ynur, and if y defines y, recursively as function of no, then 
and, for each n, yn <0 O4. If re O, ye O, Coy and 
<o2, then O5. ae O and a <ob only as required by 01-04. 

Briefly, the réle of the class O and relation <o in the theory of con- 
structive ordinals is this.2* The natural numbers which «O are mapped many- 
one on the ordinal numbers of a segment of the Cantor first and second 
number classes. The relation <o partially orders the former, and becomes 
the simple ordering relation of the latter under the mapping. The segment 
of ordinals on which the mapping takes place constitutes the so-called con- 
structive first and second number classes. 

The proof of our principal result we shall give in three main parts, which 
occupy the next three sections, and which will be assembled to give the 
theorem in 15. 


12. Analysis of O and <,» in terms of C(b) and Q. The transitivity 
clause O4 creates a difficulty for direct reduction of the definition of O. We 
shall get around this by introducing another relation aeC(b) to take the 
place of a<ob. For each b which «0, C(b) will be the class of all numbers a 
such that a<o. With this C(b) we shall define a Q equivalent to the O. 


[9] p. 153. [3], [2], [9]. 


he 
ng 
it 
is § 
ty | 
x 
ll § 
e § 
1 | 
| 


52 Ss. C. KLEENE. 


The inductive definition of C(b) is as follows. 


C1. If C2 C(3-5") [(P(y, no)) 


n=0 


+ C(®(y,o))], where terms of the sum for which ®(y,o) is undefined 
contribute no elements to C(3- 5”). C3. aeC(b) only as required by C1-C2. 

When a predicate has been defined by induction, we may use proof by 
induction, in a form which corresponds to the form of the definition by 
induction, to establish properties of the predicate. This method is to be used 
in establishing the following lemmas. The proof of (VII) is given in detail 


for illustration. 


Lemmas. (I) If aeO, thena0. (II) Ifa<ob, thenb-~1. (Use 
(1).) (IIL) C(1) is vacuous. (1V) Ifa<ob, thenaeOandbeO. (V) If 
& <o 2, then either ay ora<oy. (Use (IV).) (VI) if a<o3- 5, 
then for some n, either a = yp or a <o Yn, where yn = ®(y, no). (Use (IV).) 

(VII) If be O, then aeC(b) ~a<ob. Proof is by mathematical in- 
duction, in a form corresponding to the definition of O by induction, as follows. 
The parts of the proof numbered 1, 2, 3 correspond to the clauses 01, 02, 03, 
respectively, of the definition. 

1. By (III), ae C(1) cannot hold, and hence vacuously, a « C(1) 
—>a<ol. 

2. Assume that ye 0, and (as hypothesis of the induction) that ae C(y) 
—>a<oy. By O2, then 2% O and y <0 2”. Assume (as hypothesis of the 
implication to be proved) that aeC(2”). By C1, either a=y or aeC(y). 
If a= y, thena<o0 2. If aeC(y), then by the hypothesis of the induction, 
a<oy, and hence a <o 2” by (IV) and O4. Thus in both cases, a <o 2. 
This was under the assumption aeC(2”). Therefore, ae ('(2”) <o 2”. 

3. Assume that, for every n, yneO and Yn <o Ynsi, that y defines yn 
recursively as function of no, and (as hypothesis of the induction) that, for 
every n, (Yn) By 03, O and, for every n, yn 3° 
Assume (as hypothesis of the implication to be proved) that aeC(3- 5”). 
By C2, for some n, either a = ®(y, no) or ae C(®(y, no)). Since 
Yn = ®(y, No), we have either a = or C(yn). If a= yn, then a 3° 
If aeC(yn), then by the hypothesis of the induction, a <0 yn, and by (IV) 
and 04, a<o3-5%. Thus in both cases, Therefore, ae C(3- 
BY. 

The proof by induction is now completed. 

(VIII) If be O, then a<ob—aeC(b). Proof is similar, using (I), 
(II), (IV), (V) and (VI). 


FORMS OF THE PREDICATES IN THE THEORY OF CONSTRUCTIVE ORDINALS. 53 


The inductive definition of Q is as follows. 

Ql. 1€Q. Q2. If yeQ, then 2% Q. If, for each n, yne Q and 
yn€C(Yns1), and if y defines yn recursively as a function of no, then 3° 5% Q. 
04. aeQ only as required by Y1-Q3. 

We can now establish, using (IV), (VII) and (VIII), 


(28) acO=aeQ, 
(29) a<ob=deQKaeC(b). 


13. Reduction of aeC(b). Rewriting the inductive definition of C'(6) 
in symbols as an equivalence, 


(30) aeC(b) = (Ly) (a=yVaeC(y))} 
(Ly) {a= 3-5" & (En) [(®(y, no) is defined) & (a = ®(y, no) 
V aeC(&(y, no) )) 


Thence, using (26) and (27), 


(31) aeC(b) = (Ly) = (a=y aeC(y))} 
V (Fy) {a = 3-54 & (En) (Ez) [T (y, no, z) & (a = U(z) 
V aeC(U(z)))]}. 


Substituting “(r)V(a,b,2)” for “aeC(b),” all the existential quantifiers 
on the right come to the front, and so by the method of 8 we determine a 
general recursive (and in fact, primitive recursive) predicate V(a,b,x) such 
that 

(32) aeC(b) = 


14. Reduction of aeQ. If the natural number a is of the form 2%, 
then y = (a),; if of the form 3-5", then y= (a). Rewriting the definition 
of Y, with the use of these expressions for y, 


(33) aeQ=ea—1 {(a=—21€ (a), Vv {a=—3- 5 
& (n)[®((a)3,no) is defined 
& (n)[®((a)s, ro) €Q] 
& (n)[®((a)s, no) C(®((a)s, (1+ 1)o0)) ]}. 


Using (26), (27) and (32), 


(84) aeQ=a—1ly {a=—218 (a),€Q} v {a=—3- 5 

& (n) (Hy)T((@)s, no; 

& (n) (y)[T((a)s, no, y) >U(y) 

& (n) (y) L(T((4)s, no, y) &T((a)a, (n + 1)o,2)) 
— (Fr)V(U(y), U(z), 2) ]}. 


x 

i 

| 


54 S. C. KLEENE. 


Substituting “(x) (Hy) R(a,2,y)” for “aeQ,” with suitable changes in the 
bound variables, 


(35) (x) (Ly) R(a, 2, y) {a= (2,) R((a)1, %1, 41) } 
V {e=3- 5s 
& (22) (Eys)T ((a)s5 (2) 05 91) 
& (tz) [T ((@)s, (22) 0, —> (#4) (Eyez) R(U (2s), yo) | 
& (x2) (xs) (ts) [ (1 ((4)s, (€2)0,%3) & T((a)s, + 1)o0, ) 
— (Ey;)V(U (a3), U (x4), ys) 


The determination of R(a,z,y) as a general recursive (and in fact, primitive 


recursive) predicate such that 
(36) = (x) (Ly) R(a, 2, y) 


can now be accomplished by the technique of 8; in so doing, the following 
particulars may be noted. 

The distribution of subscripts on the bound variables in (35) is such that, 
when the advancement of quantifiers on the right is carried through in a 
suitable way, they will come to the front in the order 


(2, ) (23) (x4) (Ly, ) (Ly2) (fy). 


To see that this can be done intuitionistically, note the following. The first 
members of the two implications are recursive, so the special condition for 
the intuitionistic use of (10*) and (11*) is fulfilled. In the advancement of 
universal quantifiers across the second disjunction, (12*) will have to be used. 
If (z,) is brought first to the front of the middle disjunctive member, the 
special conditions for the application of (12*) to advance it across the dis- 
junction are realized, since a= 3-5» is recursive and excludes the part 
o = 2'%1, and hence the whole, of the operand of (2,). If next (a2), (#3) 
and (z;) are brought one at a time to the front of the last disjunctive member, 


the special conditions for the advancement of each across the disjunction are 


realized, since @ = is recursive and excludes the part a= 3-5», and 
hence the whole, of the operand of the quantifier. For the advancement of 
the universal quantifiers across the first disjunction, (7*) suffices, since a = 1 
is recursive. 

In the subsequent contraction, we can use (16) and (17), or one of these 
and the other of (14) and (15). 


15. The form of the predicates acO anda<ob. By (28) and (36), 
(37) acO= (a2) (Ly) R(a,z, y). 
By (29), (32) and (36), 


| 
| 
| 
t 
| 


Us 


FORMS OF THE PREDICATES IN THE THEORY OF CONSTRUCTIVE ORDINALS, 59 


(38) b= (x) (Ly) R(b, 2, y) & (Fx) V (a,b, 


Advancing the quantifiers and contracting, if we set S (a,b, y)= R(b, 2, (y) 1) 


& V(a, b, (y)2), then 
(39) a<ob = (x) (Ly)S(a, b, y) 
where S(a,b,x,y) is primitive recursive. Thus we establish — 


THEOREM 1, The predicates aeO and a<ob are expressible in the 
respective forms (x)(Ly)R(a,x,y) and (x) (Ly)S(a,b,2,y) where R and 8 


are primitive recursive. 


The crux of the foregoing reduction is to make all the universal quantifiers 
in the right member of (35) come to the front ahead of the existential. In 
general, the problem in applying the reduction technique is to make the 
quantifiers come to the front in such an order that after contraction they will 
be the same sequence of quantifiers as in the form substituted for the predicate, 
for some choice of that form. An attempt to reduce the predicate ae O of 
the system S, of notation for ordinals,?* which belongs to an earlier version 
of the Church-Kleene theory, failed on this point. 


16. Recursive mappings of the classes d(a<« O) and a(x) (Ey)T (a, x,y), 
each on a part of the other. Using the R(a,z,y) of the preceding theorem, 
we introduce the partial function (a,c) ~pyR(a,z,y). This is partial 
recursive, by the result on the »-operator cited in 10. Then, for each fixed a, 
we introduce ¢a(v7) = $(a, x) considered as function of the remaining variable 
a. We let e be a number defining ¢(a,z) recursively, and set F(a) =S,'(e,a). 
Then F(a) is primitive recursive, and for each fixed a, F(a) defines ¢a(z) 
recursively. For the fixed a, the partial recursive function ¢a(z) is completely 
defined, and therefore general recursive, if and only if (x) (Zy)R(a,z, y). 
Also, by (25) and (26), the condition that F(a) define a general recursive 
function is (2)(Hy)T(F(a),2,y). Therefore, 


(40) (x) (Hy) R(a, 2, y) = (x) (Ly)T (F(a), 2, y). 
Restating this with the use of (37), 
(41) acO= (xv) (Ly)T (F(a), 2, y). 


By the construction of F(a) which actually underlies this discussion, 


(42) a=b=F(a) =F (bd). 


239] p. 153. 


e @ 
A 
| 
f 


56 S. C. KLEENE. 


We now have the first part of the next theorem. 

To establish the inverse relationship, let y(a,x) ~ (®(a,0) + ®(a, 1) 
+:-+:+ (a,x) + 2)o. This function is partial recursive. For a fixed a, 
we introduce Ya(x) ~y(a,2) considered as function of x only. Let f be a 
number defining 2x) recursively, and set G(a) = 3+ 55:'(f-@), 

For any given a, let (x) be the partial function defined recursively by 
the number a. If this function is general recursive, then the values of Wa(zx) 


for r= 00, lo, are (Yo)o, (y:)0, (Y2)o,* * *, Tespectively, where 
Yo, Yi, Y2," * * are successively increasing natural numbers; and so by 03, 


G(a)e«O. In fact, G(a) then represents the ordinal w. Conversely, if 
G(a) ¢ O, then by the definition of O, the values of Ya(x) for x = 00, lo, 20,° °° 
must all be defined, which can only be the case if #(x) is completely defined. 


Therefore, using (25) and (26), 


= G(a) 


(43) (x) (Ly) T (a, 2, y 


By the construction of G(a), 


(44) a=b=G(a) =G(b). 


THEOREM 2. The set O of the numbers which represent constructive 
ordinals is mapped one-to-one by a primitive recurswe function F(a) on a 
subset of the set of the numbers which define general recursive functions 
recurswvely ; and inversely, the latter set is mapped one-to-one by a primitive 
recursive function (f(a) on a subset of the former. 

This is suggestive of the Cantor continuum hypothesis,** but the analogy 
is deficient, since under the inverse mapping all the images represent the 
single ordinal . 

In the proof of (40), the only property of R(a,2,y) which we used was 
its general recursiveness. Therefore by the same method we can set up a 
primitive recursive function H(a) such that 


(45) (x) (Hy) T.(a, a, 2, y) = (x) (Ly)T(H(a), 2, y). 


17. Specific character of the reduction given for aceO and a<ob. 
It is known that the predicate (x) (Hy)T2(a,a,x,y) is not expressible in the 
form (Hx) (y)M(a,z,y) where M is general recursive, and a fortiort not in 
any of the normal forms of 6 with fewer quantifiers.*° From this fact, we 


24 A constructive analog of the continuum hypothesis which is false is given in 
2° [10] $5. 


[15] § 10. 


| | 
| 

e 


FORMS OF THE PREDICATES IN THE THEORY OF CONSTRUCTIVE ORDINALS. 57 


shall infer the like successively for (x) (Ey)T(a,z,y), ae O, and, with 6b as 
additional variable, a <a b. 
Suppose we did have 


(a) (x) (Hy) T (a, y) = (Ex) (y) N (a, 2,y) 
with a recursive V. Then by (45) we should have 
(b) (Ly)T2(a, a, x, y) = (Hr) (y)N(H(a),2,y). 


Since then V(H(a), x,y) would be a recursive M(a, 2, y), this is impossible. 
Hence (a) is impossible.”® 
Then likewise, if we had 


(c) ae = (Bx) (y)P(a, 2, y) 


with a recursive P, using (43), P(G(a), 2, y) would be a recursive NV (a, 2, y) ; 
and so (c) is impossible. 
Finally, suppose that we had 


(d) a<ob = (£r) (y)Q(4, b, z, y) 
with a recursive (. ‘Then we should have by substitution, 
(e) a <o (Ex) (y)Q(a, 2", 2,y). 

By O02 and (IV), 

(46) 227=ae0. 

From (e), we should have by (46), 

(f) ae O = (Ez) (y) Q (a, 2%, 2, y). 


Since then Q(a, 2%,2,y) would be a recursive P(a,z,y), this is impossible. 
Hence (d) is impossible. 


THrorEM 3. The predicates ae O and a <ob are not expressible in the 
forms dual to those of Theorem 1, nor in any of the forms with fewer 
quantifiers. 


AMHERST COLLEGE, 
AMHERST, MASs. 


*6 This supplies the proof of [7] XI from [10] §5 referred to in [10] § 4 foot- 
note (9) 


) 
a 
| 
) 
| 
| 


58 Ss. C. KLEENE. 


BIBLIOGRAPHY. 


CuurcH, ALonzo. [1] “An unsolvable problem of elementary number theory,” 
American Journal of Mathematics, vol. 58 (1936), pp. 345-363. [2] “ The constructive 
second number class,” Bulletin of the American Mathematical Society, vol. 44 (1938), 
pp. 224-232. 

CHuRCH, ALONZO and KLEENE, 8S. ©. [3] “ Formal] definitions in the theory of 
ordinal numbers,” Fundamenta Mathematicae, vol. 28 (1936), pp. 11-21. 

GOpEL, Kurt. [4] “Uber formal unentscheidbare Siitze der Principia Mathe- 
matica und verwandter Systeme I,” Monatshefte fiir Mathematik und Physik, vol. 38 
(1931), pp. 173-198. [5] “On undecidable propositions of formal mathematical 
systems,’ mimeographed notes on lectures at the Institute for Advanced Study (1934). 

HILBERT, DAvID and BERNAYS, PAUL. [6] Grundlagen der Mathematik, vol. 1 
(1934), Berlin (Springer). 

KLeEENE, S. C. [7] “General recursive functions of natural numbers,” Mathe- 
matische Annalen, vol. 112 (1936), pp. 727-742. [8] “A note on recursive functions,” 
Bulletin of the American Mathematical Society, vol. 42 (1936), pp. 544-546. [9] “ On 
notation for ordinal numbers,” Journal of Symbolic Logic, vol. 3 (1938), pp. 150-155. 
[10] “ Recursive predicates and quantifiers,’ Transactions of the American Mathe- 
matical Society, vol. 53 (1943), pp. 41-73. [11] “ On the interpretation of intuitionistic 
number theory,” Bulletin of the American Mathematical Society, vol. 48 (1942), 
abstract 85, p. 51. 

Netson, Davip. [12] “ Recursive functions and intuitionistic number theory,” 


under preparation. 

PETER, Rozsa. [13] “ Uber die mehrfache Rekursion,’ Mathematische Annalen, 
vol. 113 (1936), pp. 489-527. 

Turine, A. M. [14] “On computable numbers, with an application to the Ent- 
scheidungsproblem,” Proceedings of the London Mathematical Society, ser. 2, vol. 42 
(1937), pp. 230-265. [15] “ Systems of logic based on ordinals,” ibid., vol. 45 (1939), 
pp. 161-228. 


THE RESULTANT OF A LINEAR SET.* 


By Ernst SNAPPER. 


Introduction. Hentzelt and Noether [1] have discussed the resultant of 
an ideal consisting of polynomials in n variables. In the present paper the 
corresponding theory for a linear subset of an m-dimensional vector space 
whose scalar domain consists of polynomials in n variables is developed. The 
results of [1] are shown to hold equally well for any number of dimensions, 
a step toward a theory of matrices whose elements are such polynomials. 

Assuming that the linear sets have been transformed by means of a linear 
transformation (see 2), the main results are: With every linear set L of such 
a vector space, a resultant p is associated which is a polynomial in n variables. 
This resultant vanishes for, and only for, the common roots of the polynomials 
of the ideal L/Cl(L). (See 6.) Furthermore, if then Lz = ly 
if and only if L, and Lz have the same rank and resultant. 

This demonstrates the importance of the algebraic manifold of the ideal 
L/Cl(L) for the linear set L. The resultant p can be factored into exactly 


n factors p=] pp”) such that every common root of the polynomials of 


I./Cl(L) corresponds to one of these factors. Furthermore, the multiplicities 
of the irreducible factors of p‘”? (the multiplicity being the product of degree 
and exponent) are equal to the number of independent restclasses of certain 
factor groups which are uniquely determined by LZ. The last statement of 
the results gives a criterion for the existence of a polynomial solution of a 
system of linear equations with polynomial coefficients. (See Theorem 5. 3.) 
This is the analogue of the theorem (see [2]) that in an algebraic number 
svstem, a system of linear equations can be solved simultaneously if and only 


se 


if the matrix of the system and the “ augmented ” matrix have the same rank 
and the same highest dimensional determinantal factor. 

Since the methods of Hentzelt and Noether [1] are immediately applicable 
to vector spaces, direct reference is made to their proofs and additional details 
are given only where necessary. 

Notation. Scalars are indicated by lower case Greek letters; linear sets, 
matrices and integral domains by capital Latin letters; vectors by lower case 
Latin letters and ideals by lower case German letters. 


* Received October 21, 1942. 


59 


60 ERNST SNAPPER. 


1, The linear subset of a general vector space. Let Vm be an m- 
dimensional vector space, consisting of row vectors with m components where 
the components form an integral domain S, called the scalar domain of the 
vector space. A linear subset LZ of Vm is a subset closed under vector sub- 
traction and scalar multiplication. The quotient L/L. of two linear sets, the 
quotient L/a of a linear set by an ideal, the product aZ of an ideal and a 
linear set, the closure Cl(Z) of a linear set and the notion of a closed set have 
been previously defined in [3]. We also use these definitions for denumerably 
infinite dimensional vector spaces, whose vectors have a denumerably infinite 
number of components of which only a finite number are different from zero. 
Returning to a finite number of dimensions m, the 1-th determinantal factor 
d; of a linear set L (or of a matrix A with m columns) is the ideal generated 
by the minors of dimension m --- 7 of the matrices whose rows are vectors of L 


(or just by the minors of dimension m —-i of A) fori=0,---,m—1. For 
i= m we define dj = S as is done by Fitting [4]. The j-th invariant factor 
of L is, for = 1,2,- -, defined as ej = where d, is the first non- 


zero determinantal factor of Z. Hence, a linear set or a matrix has an in- 
finite number of determinantal factors and invariant factors, and 0;-, C di. 
Furthermore, if the rows of the matrix A form a set of generators of L, the 
i-th determinantal and hence invariant factors of Z are equal to those of A. 

Let us now assume that every ideal of the scalar domain S has a finite 
ideal basis. The rank r of a linear set LZ is then defined as the maximal number 
of vectors of L which are linearly independent with respect to S. It can easily 
be proved [1, pp. 56-58] that Z and Cl(L) always have the same rank and 
that Cl(L) is the only closed linear set (called “ Grundmodul ” in [1]) which 
contains L and has the same rank as L. Hence, a linear set is closed if and 
only if it is not contained in a different linear set which has the same rank. 
Finally, Cl(L) = L/dm-r [1, Theorem III, p. 58]. 

The factor group Cl(L) + LZ is a module with a finite number of genera- 
tors and with S as operator domain. With each set of generators of such a 
“ finite” module, a null space is associated which is the linear set consisting 
of the vectors which annul the generator set. In [4, p. 197], the determinantal 
factors of such a null space are proved to be independent of the underlying 
generator set and are consequently called the determinantal factors of the 
module. For the same reason, the invariant factors of these null spaces will 
be called the invariant factors of the module. The first non-zero determinantal 
factor of a module will be called the norm of the module. It is clear that 
if Cl(L) has a linearly independent basis which is transformed by the matrix 
A into a set of generators of L, the determinantal factors and invariant factors 


| 


THE RESULTANT OF A LINEAR SET. 61 


of A and of Cl(L) +L are the same and, consequently, the norm of 
Cl(L) + L is equal to the first non-zero determinantal factor of A. If S isa 
principal ideal ring, any generator of the norm of Cl(L) --L will be con- 
sidered as a norm and will be denoted by v(C1(L) +L). 

Finally, if S is a Euclidean domain and LZ has rank r, e; Ce. C- - - and 
the generators 6; and ¢; of the ideals 0; and e; respectively can be chosen such 


that 8m-;—= J] « for j—=1,---,r. Since we can then choose (see [1, 
t=r-j+1 

pp. 55, 56 and 60]) a linearly independent basis u,,- - -, ur for Cl(L) such 

that is a basis for L, the norm of Cl(L) +L is the scalar 8n-, 


and the non-zero determinantal factors and invariant factors of LZ are equal 
to those of Cl(L) +L. If S consists of polynomials in one variable with 
coefficients in a field ?, the degree of 8»-- is equal to the number of restclasses 
of Cl(L) + L which are linearly independent with respect to P. Furthermore, 
bearing in mind that in the present notation the highest dimensional invariant 
factor of L is called ¢,, the following theorems of [1] are used: I [p. 57], 
IV and V [p. 60]. 


2. Linear transformation of variables. Let V, be an m-dimensional 
vector space which consists of row vectors with m components and whose scalar 


domain S = consists of polynomials in n variables 
with coefficients in a field P. If we adjoin the variables yo1,° - -,yn,n-1 to P 


(see [1, Section 3]), the scalar domain becomes S(y) = P(y)[m,-° °°, 2]. 

Furthermore every linear set L of Vm has then to be extended to the linear 

set L(y) consisting of the vectors T(y) = } oj(y)T;, where w;(y) denotes an 
j 


arbitrary monomial of the variables yi,; and where the vector 7; is a vector 
of L (disregarding multiplication by elements of P(y) as is done in [1, p. 62, 
equations 13]). If we then transform the variables m,- - +, by the fixed, 
linear, reversible transformation 7 = U(€), that is m = &, mn = 

+ yn,n-rén-1 + én (see [1, p. 61]), every linear set L(y) is transformed into 
a linear subset L of the m-dimensional vector space Vm whose scalar domain 
S = P(y)[&,° consists of polynomials in n variables &,- - -. & with 
coefficients in P(y). We shall call a linear set Z of Vm a transformed linear 
set if Z can be obtained from a linear set Z of Vm by first adjoining the 
variables yi; to P and then transforming the m,° by means of 7 = U(&). 
The criterion, expressed in [1, p. 62, equations 15], holds without change, 
i.e. the linear set Z is transformed if and only if in the expression 
v= U(U"(v)) =U (3S o;(y)t;) = oj where v is an arbitrary 

J 


vector of L, the vectors l’(%;) are again vectors of L. 


| 


62 ERNST SNAPPER. 


We know from [1, Section 3] that these two processes of adjunction and 
linear transformation replace the ideals of S by the transformed ideals of S, 
which have the property of containing at least one polynomial which is regular 
with respect to é,. The following theorem is the main reason why the methods 
of [1] are applicable to an arbitrary number of dimensions: 


THEOREM 2.1. Jf the linear set L is the transform of the linear set L, 
the 1-th determinantal factor d; of L is the transform of the i-th determinantal 
factor d; of L and the same holds for the invariant factors. 


Proof. The i-th determinantal factor d; of Z is generated by the (m—1)- 
dimensional minors A of the matrices whose rows 1;,° * -,Um are vectors of L. 
Since = oj (y)t;, where =0(L), U-(A) = (y)4;, where 


A; is an element of d;. Consequently, d; is contained in the transform of dj. 


However, every matrix whose rows are vectors of LZ is transformed by 7 = U (€) 
into a matrix whose rows are vectors of L, and hence every element of the 
ideal D; is transformed into an element of the ideal d; which proves that the 
transform of d; is equal to dj. The statement about the invariant factors then 
follows immediately from [1, p. 77, Section 7] where it is proved that the 
quotient of two transformed ideals is equal to the transform of the quotient 
of these ideals. 

As the transform of an ideal is the zero-ideal if and only if the ideal 
itself is the zero-ideal. we have as a corollary of Theorem 2.1 that a linear 


set and its transform have the same rank. 


3. The nm closures of a linear set. Let Vm again be the m-dimensional 
vector space with scalar domain S = P(y)[&,--+,&]. We shall now con- 
sider the denumerably infinite dimensional vector space V‘‘-”) with scalar 
domain S‘) = P(y)[&,- -,& fori=—1,---,n. The fundamental vectors 


i -1 
of V‘-)) are the vectors [] &"*f;, where J] &.** is any monomial of the 


8=1 s=1 
variables &,,- - -,&i-. (each k, can assume all positive integral values, zero 
included) and where f; is an arbitrary fundamental vector of Vm for 
j=1,--+,m. The scalar domain S‘), whose scalars will be indicated by 
the upper index i, consists of polynomials in &,- - -,&, with coefficients in 
P(y), while the vectors of V‘‘-?) are linear combinations, with coefficients in 
S‘®, of the fundamental vectors where only a finite number of the coefficients 


is different from zero. 
Each vector v of Vm is associated with a unique vector of V‘”), which 
we obtain by ordering the components of v with respect to the monomials of 


THE RESULTANT OF A LINEAR SET. 63 


the variables é,,- - -,éi1. (See [1, p. 64].) This correspondence is clearly 
an operator isomorphism 7;-, with respect to S‘” as operator domain, which 
associates a linear set = of with every linear set of Vin. 
To the closure Cl(/i-,) of Li, there corresponds a linear subset of Vim which 
is called the (¢-—1)-th closure of L according to the following definition (see 
[1, definition V]) : 


DEFINITION 3.1. The (i—1)-th closure Clis(L) of a linear subset L 
of Vm consists of the vectors v of Vm for which we can find a non-zero scalar 
of such that av =0(L). 


Consequently, we always have the relationships 7’ )= Cl (Li-1) 
and Cl;(L) C Cl;_,(L). Furthermore, Cl,(L) = Cl(L). 


THEOREM 3.1. The closures of a transformed linear set are themselves 


transformed linear sets. 


The proof of this theorem is the same as that of [1, Theorem VI] if the 
following extension of the theorem of Dedekind and Mertens to vectors is used 
(see [1, p. 63]): 


THEOREM OF DEDEKIND AND MerTENS. Let a;,.,.i, be variables and let 
bj,...3, be vectors whose t components are variables and let the vectors Cr,...%, 
be defined by the product of the scalar polynomial and the vector polynomial 


where the polynomials are of finite degree. Then, there exists an integer q 
such that a4B = at, where a, B and C are the modules which consist of the 
linear, rational integral combinations of the a’s, b’s and c’s respectively and 
where a) consists of the linear, rational integral combinations of the monomials 
of degree j of the a’s. 


The reasoning of Dedekind’s proof in [5], applied to each component 
oi the b’s and the c’s, proves this extension immediately. 


4. The factor module Cl(L;_,) —Li_,. Let L be a transformed linear 
set of the same vector space Vm. Since S‘* is the scalar domain of the vector 
space the factor module Cl(Li-,) + has S“ as operator domain. 
The following theorem shows that, although V‘*?) is an infinite dimensional 
vector space, the factor module Cl(Li-,) + Li-1 has a finite number of genera- 
tors (see [1, Theorem VII]): 


64 ERNST SNAPPER. 


THEOREM 4.1. The factor module Cl(Li-,) — Li-s is operator isomorphic 
to the finite factor module Cl(L’;-.) ~ L’/i-. with respect to S as operator 
domain, where L’;_, and its closure Cl(L’i-1) are linear subsets of a finite 
dimensional vector space V’‘*-») with S‘ as scalar domain. Hence, Cl(Li-1) 
+ Li, is a finite module whose determinantal factors and invariant factors 
are equal to those of Cl(L’i;.) ~—L’is. Clearly, these statements are still 
valid if the variables €i.1,: - -,€, are adjoined to P(y) and S‘® is replaced by 


Proof. The theorem is proved by induction. (See [1, pp. 64-70]). For 
1== 1 the theorem is trivial since V°°) is equal to Vm and S‘ is equal to S. 
For 12 we observe that, as L is a transformed linear set, its highest di- 
mensional non-zero determinantal factor is a transformed ideal (see Theorem 
2.1) and consequently contains at least one polynomial which is regular with 
respect to €,. From here on, the proof is exactly the same as that of [1, pp. 
68-69], assuming that i= 2. Hence, for ¢ the following induction hypotheses 
may be made: There exists an operator isomorphism F;-, with respect to S‘” 
as operator domain which associates Cl;.(Z) with a linear subset of the 
denumerably infinite vector space W‘*") whose scalar domain is S‘’. The 
first s fundamental vectors of W‘*") are denoted by +, Us and the re- 
maining ones by hy, ad infinitum, i.e. = Us, ko, hy, 
ad infinitum. Furthermore, F-,(Cli_.(L)) = (Cl(L’i-1), Kis) and 
= (L’;_,, Ki_,), where L’;-; is a linear subset of the space V’‘*") generated by 
the w’s, and where K;_, denotes the space generated by the k’s. Finally, if 
instead of the linear transformation 7 = U(é), we use the special linear trans- 


formation 7 = U’;_,(é,¢), that is 


the above hypotheses still hold, and the linear sets and decompositions which 
then occur are transformed into the old ones by the linear transformation 
U'-*- U. The theorem can then be proved immediately for i. (See [1, p. 67 ].) 
In order to prove the theorem for 1+ 1, we first observe that the highest 
dimensional non-zero determinantal factor of Z’;-,; contains at least one poly- 
nomial x‘‘) which is regular with respect to &. This follows from the fact 
that L’;_, is the transform with respect to U‘-; - U of the corresponding linear 
set which occurs if we use U’;_, instead of U as the linear transformation. 
Theorem 2.1 then asserts that the determinantal factors of L’;_, are trans- 
formed ideals with respect to U’;.,-U. The rest of the proof can be copied 


from [1, pp. 68 and 69]. 


| | 


THE RESULTANT OF A LINEAR SET. 


If the field P is infinite, the variables yi,; of the transformation » = U (€) 
can be specialized as elements of P in such a way that the n polynomials r‘# 
(i=1,:--,n) remain regular with respect to é;. Consequently, Theorem 
4,1 remains valid for such a specialization and the linear transformation 
n = U(é), which was used to transform the linear sets into transformed linear 


sets, may then be considered as a transformation with coefficients in P. 


5. Resultant and elementary divisor of a linear set. Let Z again be a 
transformed linear set and let the variables &,1,- + -,& be adjoined to P(y). 
The scalar domain of V‘*") and V’‘*-) is then the Euclidean domain §’‘ 
and the linear set L’;_, has the properties described at the end of 1. As the 
polynomial z‘") of the previous section is a polynomial of the highest di- 
mensional non-zero determinantal factor ds, of L’i_1, the generators 8; and ¢; 
of the determinantal factors and invariant factors respectively of L’i-. can be 
considered as polynomials of S‘) which are regular with respect to €:. (See 
p. 71].) Denoting by and by we conclude from 1 that 


I] €j om p'4) = y(Cl(L’;-; —- while (p )= 


and finally (e'"’) = 0’;,/Cl(1’;_). (A scalar between parentheses denotes 
the ideal generated by that scalar.) It follows from Theorem 4.1 that p 
and e') can also be considered as the generators of the highest dimensional 
non-zero determinantal factor and invariant factor respectively of Cl(Li-+) 
and hence, =v(Cl(Li-) Lin) and (e) = Li4/Cl(Li-1). 
(See [1, p. 71].) 

Let us now return to the scalar domain S‘) and again consider the 
variables as not adjoined to P(y). The following definition is 
the same as for ideals (see [1, Definition VI, p. 71]): 


DEFINITION 5.1. The polynomials p‘ and «*) are called the 1-th re- 
sultant and the i-th elementary divisor of the linear set L. The resultant p 


n n 
and the elementary divisor are then defined as p=J|[ and e=[]«™. 
i=1 i= 


Theorem VIII in [1, p. 71] also holds for linear sets: 


n n 
THEOREM 5.1. The product [J p% can be divided by [] «‘ and both 

j=i j=i 
these products are contained in L/Cl,_,(L). Consequently, for 1=1, the 
resultant p of a linear set L can be divided by the elementary divisor « and 


both are contained in L/CU(L). Finally, a power of ¢ can be divided by p. 


65 
ic 
e 
| 
h 
r 
| 


66 ERNST SNAPPER. 


Proof. The proof is the same as in [1, pp. 71-72] if we bear in mind that 
Cl,(L) is not necessarily the whole vector space as in the case of the ideals 
but is equal to the ordinary closure, Cl(L), of L. 

The next theorem was discussed in the introduction: 


THEOREM 5.2. Z/f the linear set L, contains the linear set Lo, then they 
are equal if and only if they have the same rank and the same resultant. 


Proof. As L.C L;, we know that Cl(Z.) C Cl(L,). Since the ranks 
oi L, and of Lz are the same, we conclude that Cl(L.) = Cl(L,) (see 1), i 
Cl,(L2) = Ciy(L,). The proof then proceeds as for ideals. (See [1, Theorem 
IX, p. 72].) 

In the corresponding theorem on ideals the rank is not mentioned since 
all ideals have the same rank, namely 1. 

For a matrix A = (a;;), where the a;; are elements of S and whose rows 
generate a transformed linear set, the resultant is defined as the resultant of 
that linear set. Considering only matrices whose rows generate transformed 


linear sets, we have the following immediate corollary of Theorem 5. 2: 


where the a; and y; are elements of S, can be solved ata for elements 
of S tf and only if the matrix A = (a:;) and the augmented matrix, which 
we obtain by adding the row (y1,° * *,ym) to A, have the same rank and the 


same resultant. 


THEOREM 5.3. A system of linear equations dha 


The following theorem and its proof are word for word the same as in 
[1, p. 73]: 

THEOREM 5.4. Let p‘”) = a')B be a factorization of the i-th resultant 
of the linear set L into two relatively prime polynomials of 8S“. Then, there 
exist two unique linear sets A and B which both contain L such that 
= Cl,- (3) =(Cl;_,(L) while after adjunction of to 
P(y), +-Aia) and Bi.) =B™. Further- 
more, Cl;(L) = a and A=C1,(L/(B™)) and B= Cl (L/(a™)). 


6. The resultant @ and the ideal L/CI(L). A system of i elements 

of the algebraic closure P’ of P(y, is called a root of 
dimension n —i of an ideal a of S if for = fin, 
-+,&:—=é&n, the polynomials of a vanish. From the theory of elimination 
of a, as explained in [1, Chapter VI], we obtain the properties of the sequence 
of ideals Qui: C+ - *Coa,, which is uniquely determined by a. If we then 


5 


ls 


C 


THE RESULTANT OF A LINEAR SET. 67 


take the ideal L/Clj_;(Z) as the ideal a, we prove in the same way as in 
[1, pp. 77-78] that the ideals a,,- - -, ai are different from the zero-ideal and 
that the i-th elementary divisor e‘ of Z is a divisor of the highest common 
factor of the polynomials of-a;. Since this remains true if we first transform 
the variables by the special transformation, = — & 
aud adjoin the @’s to P(y), and as furthermore a power of e‘* can be divided 
by the i-th resultant p‘ of ZL, Theorem XTII of [1, p. 78] holds for linear sets: 


THEOREM 6.1. Every root & ¢«P’ of the i-th resultant of the linear 
sei L can be extended in a unique way into a root of dimension n—i of 
L/Cl,_.(L), and consequently of L/C1(L). If we use the above special trans- 
formation, the polynomial p“ is replaced by the polynomial p™ (é, £). which 
can be factored in P’(a) as follows: 

Here, &;,° represent all the roots of dimension n—i of L/Cl;_.(L), 
which we obtain in the above way from the linear factors of p\?. This fac- 


torization remains valid if the variables are replaced by elements 


of P(y). 


As p=I[] p’ =0(L/Cl(L)), every root of L/Cl(L) is the extension 
j=1 


of a root of one of the factors p‘"’ of p. Theorem 6.1 asserts the converse, 
namely that every root of a factor p‘’ of p can be extended to a root of 
L/Cl(L). Consequently the “main results” in the introduction are com- 
pletely proved. 


Theorem 5.4 asserts that the factorization of p‘) into powers of irre- 


t 
ducible polynomials of = I a gives rise to a unique decomposition 
j= 
Cl,(L) = [AM such that after adjunction of to 
P(y), v(Cl(Li ) = Hence, the number of restclasses of the 
factor group Cl(L;.) + A‘), which is independent with respect to the field 
P(y,; €is1," is equal to the multiplicity of the factor «; of (See 


the Introduction and 1.) 

Finally, for the reasons explained in [1, p. 79], if the field P is infinite, 
the variables y;,; can be specialized as elements of P in such a way that all 
the above results remain valid. Hence, in that case, the linear transformation 
» = U(€), which was used to transform the linear sets into transformed linear 
sets, may be considered as a transformation with coefficients in the field P. 


it 

1 

f 

| 


658 ERNST SNAPPER. 


REFERENCES. 


1. K. Hentzelt and E. Noether, “ Zur Theorie der Polynomideale und Resultanten,’ 
Mathematische Annalen, vol. 88 (1922), pp. 53-79. 

2. E. Steinitz, “ Rechteckige Systeme und Moduln in algebraischen Zahlkérpern I,” 
Mathematische Annalen, vol. 71 (1912), p. 340. 

3. E, Snapper, “ Structure of Linear Sets,” Transactions of the American Mathe- 
matical Society, vol. 52 (1942), pp. 258-259. 

4. H. Fitting, “‘ Die Determinantenideale eines Moduls,” Jahresberichte der 
Deutschen Mathematiker Vereinigung, vol. 46 (1936), p. 197. 

5. R. Dedekind, “ Uber einen arithmetischen Satz von Gauss,’ Dedekind Gesam- 
melte mathematische Werke I] (1931), pp. 35-38. 


PRINCETON UNIVERSITY, 
PRINCETON, N. J. 


r-REGULAR CONVERGENCE SPACES.* 


By Pau A, WHITE. 


In most previous work on hyperspaces such as the space of all closed 
subsets or of all continua in a given space M, the well known Hausdorff metric 
has been used. In this work the hyperspace K’ of Ic" closed subsets of a com- 
pact metric space M, where r is an arbitrary integer, is considered. (See 
Definition 1.2). It is easily seen that the Hausdorff metric, which makes the 
limit concept in the hyperspace correspond to ordinary point set convergence 
in the original space, is not desirable here. For example, A" would not be 
closed in the space of all closed subsets of MV. However G. T. Whyburn has 
introduced the notion of regular convergence (see Definition 1. 1) which does 
require that the limit set be lc’. The major part of this work is devoted to 
establishing a metric in A" where regular convergence is used to define the 
limit concept in A’. 

After the main part of this work was completed it was learned that S. 
Mazurkiewicz * had considered the hyperspace of locally connected subcontinua 
of a locally connected compact metric space M. This is what is called the 
set K,° in this work where r= 0 (see Definition 7.2). Also we require only 
that 1 be a compact metric space. It is interesting to note that the limit 
notion used by Mazurkiewicz is equivalent to ours although phrased differently. 
It should be mentioned however that his metric is complete whereas ours is 
only complete in very special cases. In this respect his is more desirable than 
ours although defined in a more restricted space. 


general topological 


A large part of the work is devoted to finding rather 
properties of A”. Some time is spent in considering what special properties 
Kr will have if WV is specialized and conversely. It is found that very general 
hypotheses on A” or A,° demand very special kinds of spaces for M. It is 
hoped that through the study of A” many properties can be deduced that could 
not be obtained from the usual hyperspaces. This hope is based on the fact 
that not so many properties of the original space are carried over to our 
hyperspace as is true in the case of the usual ones. Thus many more essen- 
tially different things can arise in the hyperspace that did not appear in the 
original space. For example, the hyperspace almost always has more com- 


* Received June 14, 1942. 


1 Fundamenta Mathematicae, vol. 24 (1935), pp. 118-134. 69 


70 PAUL A. WHITE. 


ponents than the original space. Thus certain collections of subsets are seen 
to be related that might not have been seen previously. 


1. The definition of convergence and the L* axioms. In this entire 
work, we shall assume that our space M is compact and metric. All of our 
ordinary complexes and cycles shall have modulus two coefficients and the 
Vietoris cycles used shall consist of these as codrdinate cycles. A knowledge 
of the definitions and fundamental properties concerning Vietoris cycles will 
be assumed and the words “complete cycle” or just “cycle” will be used 
throughout to mean “ Vietoris cycle.” For a treatment of these combinatorial 
concepts reference is made to the original paper of Vietoris.* 


DEFINITION 1.1. The sequence of closed sets [Ai] will be said to con- 
verge r-regularly to A, provided [Ai] converges to A ([Ai] A), and that 
for every « > 0 there shall exist a 8 >0 and an N such that if n > N, any 
r-dimensional cycle in A, of diameter <8 is homologous to 0 (~0) in a 
subset of A» of diameter <«. (By the diameter of a complete cycle, we mean 
the smallest diameter of a carrier of the cycle. A carrier of a cycle V is a 
closed set P such that V is a cycle of P.) 

DEFINITION 1.2. A closed set A is said to be locally y’-connected (r — Ic) 
provided that for every « > 0 there shall exist a 6 > 0 such that every r- 
dimensional cycle of A of diameter < 6 is homologous to 0 in a subset of A 
of diameter <«. A closed set is lc” if it is s—-le for alls, (O=sSr). 

Notation. We shall denote by H the hyperspace of all closed subsets of JZ, 
and by K” the hyperspace of all closed subsets that are lc’. We shall think 
of r as arbitrary but fixed until otherwise stated, so that we shall write K 
instead of K" henceforth. A capital letter will denote a closed subset con- 
stituting an element of K, and the same small letter will denote the point 
of K corresponding to this set. The symbol “—™” shall denote r-regular 
convergence, and “—>” shall denote s-regular convergence forallsSr. 8(A) 

<p 
shall denote the diameter of the set A and U,(A) shall denote the set of all 
points whose distance from some point in A is <«. Finally C shall denote 
the boundary of the complex C. 

DEFINITION 1.3. The sequence [p;] in H shall be said to converge to a 
limit p in H ([pi] —p), if and only if [Pi] >P. 

sr 


THEOREM 1.1. Any infinite subsequence of a convergent sequence in H 
is a convergent sequence with the same limit. 


* Vietoris, Mathematische Annalen, vol 97 (1927), pp. 454-472. 


-REGULAR CONVERGENCE SPACES. 
The proof is a direct application of the definition of convergence. 
THEOREM 1.2. If pi =p for all i, then [pi] > p if and only if P is le’. 


Proof. Clearly the definitions of s-regular convergence and locally y°- 
connectedness coincide when pi = p for all 4; hence the theorem. 


THEOREM 1.3. Lach point of H will converge to itself in the sense of 
Theorem 1.2, if and only if M is finite. 


Proof. The sufficiency follows immediately from Theorem 1. 2, for if M 
is finite then every subset is both closed and Ic’. 

Conversely, suppose that H has the property of the theorem, but that M 
is infinite. Since J/ is compact there exists an infinite convergent sequence 
of distinct points [P;] > P. Let A be the closed set [Pi] + P. By Theorem 
1.2 A should be Jc’, but it is not since Pi + P (t—1,2,---) isa sequence 
of 0-cycles whose diameters converge to 0, but which do not bound in A. 


CoroLiary 1.21. LHvery point of K converges to itself in the sense of 


Theorem 1. 2. 


Proof. The proof follows immediately from the definition of K and 
Theorem 1. 2. 

Note. Theorem 1.3 and Corollary 1.21 make it seem wise to abandon 
the study of H in general and restrict ourselves only to K. 


TueorEM 1.4. Jf [pi] in IT does not. converge to p, there exists an 
infinite subsequence [pn,| such that no subsequence of it converges to p. 


Proof. Case 1. [Pi] —>P but not s-regularly for some sr. This 
means there exists an « > 0 such that for any § > 0 and N, there is ann > N 
and a complete s-dimensional cycle yn* in Py with 8(yn*) <8 but yn* not 
homologous to 9 in a subset of P» of diameter <<. Pick [8:]—>0 and [Ni] 
— «, and let ni; > Ni be chosen as above. Now the sequence [Pn,] clearly 
contains no s-regular convergent subsequence so that [pn,] has no convergent 
subsequence. 

Case 2. [P;| does not converge to P. Since M is compact there exists 
ane > 0 and an infinite subsequence [Pn,] such that each Pn, has the property 
that either P¢l’.(/’;) or Pi¢Ue(P). Clearly no subsequence of [Pn,] con- 
verges to P and hence no subsequence of [pn,] converges to p. 


THEOREM 1.5. K is an L*-space of Kuratowskt. 


l 
tL 
) 
= 


7% PAUL A. WHITE. 


Proof. Kuratowski has given the name “ Z*-space” to any space in 
which a convergence notion is defined such that the properties in Theorems 
1.1, 1.21, and 1. 4 hold. 


2. Open sets and their relation to the limit concept. Henceforth we 
shall consider only the space K and all notions such as complement of a set FE 
(denoted by C(#)) will be relative to K unless otherwise stated. 


DEFINITION 2.1. We shall call p a limit point of the set E contained in 
H provided there exists an infinite sequence of distinct points of Z converging 
to p as a limit. 

DEFINITION 2.2. A set is closed if it contains all of its limit points. 

DEFINITION 2.3. A set is open if its complement is closed. 


The following lemmas, which we now prove, will be useful in proving the 


following theorems, 


LEMMA 2.1. If A is Ic’, then for every set of numbers e, d, n, p such 
that e >d>0,»>0, p> 0 there exists a > 0, such that for every 8-cycle 
Ct of A of diameter < d, there is a complete cycle Dt = (D,", D.",- + +) of 
diameter < minimum (3d, €) such that ot a acu for all j in an e-subset of A. 
Also this homology takes place in the p-neighborhood of C", which therefore 


contains Dr, 


Proof. This lemma follows directly from Lemma 1 of a paper by R. L. 
Wilder.’ If we choose a < y, (e—d)/2, d, p then by that lemma there exists 
a number 6 > 0 such that every 8-simplex has a Vietoris chain realization of 
diameter < a. It follows from the definition given in that paper that when 
we add these realizations for all simplices of C’ (taking subsequences when 
necessary), we obtain a Vietoris cycle D" = (D,", D.",: - -) which clearly has 
diameter < d+ 2a << d+ 2d=3d, or d+ 2(e—d)/2=«. Also by using 
the prism construction,* we see that for each J, ~ hence 
Finally we recall that the prism construction requires no new vertices; hence 


the homology occurs in the « and hence the p-neighborhood of C’. 


LemMa 2.2. If [Ai] — A, where each Aj is closed, and e>0, d>0 
are numbers such that if y’C A is a complete cycle of diameter = d. then 
y'~ 0 in a subset of A of diameter < e, then for any « > 0 there is a 8 > 0, 


3R. L. Wilder, Duke Mathematical Journal, vol. 1 (1935), p. 546. 
4 Alexandroff-Hopf, Topologie, p. 199. 


Mm 


Ue 


’-REGULAR CONVERGENCE SPACES. 73 


e, <e,and N such that if n > N, any s-dimensional 8-cycle in An of diameter 
= dis 0 in an e,-subset of 


Proof. If the lemma were false, then for some « > 0, there would exist 
sequences [8;] [e:*] >0, [Ni] 0, and a 8-cycle Ci* in Ay, of 
diameter = d which is not e-homologous to 0 in a subset of Ay, of diameter 
<e,'. Pick a convergent subsequence of the point sets [C;*] converging to a 
limit set C in A. 8(Ci*) Sd for all i, hence 8(C) Sd. We can further 
suppose that the subsequence [Ci*] was so chosen that Us,(Ci) \ C, and 


U3,(C) OC; for all Let Cy = (2, where the 2; are the 
vertices of Cj. For each j & g, let y; be a point of C such that p(2j, yj) < 8. 
For each simplex (24, +, %i,) in Ci, let Yi,) be a simplex 


and let Dj*® be the cycle composed of all these simplexes.® Clearly Dj* is a 
36;-cycle and 8(D;*) Sd. Since C is compact and the meshes [38;] > 0, 
we can pick a subsequence of the [D;*] so as to form a Vietoris cycle. Call 
this cycle D® = (D,’, D.’,- - +). By hypothesis D*~ 0 in a subset of A of 
diameter < e. That is, D® ~0 in a subset whose diameter e’ is < e. Hence 
there is an V’ such that if 1 > N’, Die = Bs", where is an ¢/3-complex 
and 8(£;**') Se’. Pick k& such that e—e,* < (e—e’)/3, k >N’ and 
< (2(e—e’)/9, €/3). Project into Ax, keeping vertices of fixed. 
Let the resultant complex be < e’ + 3- (2/9) (e— =e’ 
+ (2/3) (e—e’). But e—-e*—e’ < (e— e’)/3— e’, or (2/3) (e—e’) +e’ 
<e, and < Also is an + (€/3) = e-complex, and 
Fy" =C;*. But this says that Ci! = 0 in a subset of A of diameter < e,* 


contrary to our assumption; hence the lemma must be true. 


LemMA 2.3. If [Ai] —A, where each A; is closed and Ict-1, and 
< 


e > > 0 are numbers such “that every complete r-dimensional cycle y" in A 
with 8(y") = 8, is homologous to 0 in a subset of A of diameter < , then for 
every « > 0 there isan N such that if i > N and y" ts a complete r-dimensional 
cycle of A; of diameter S 8,, then y" ~ 0 in a subset of Ai of diameter << e+ ca. 


Proof. Suppose the 'emma is false, then there is a o > 0, a sequence 
[ni] > co, and a sequence of complete cycles [yi"] in [An,] such that 
8(yi") S8,, but yi” is not homologous to 0 in a subset of An, of diameter 
<e+o. Thus if yi” = (Ci, Ciz,: - +) then for each i there is a number 
7i > 0 such that for only a finite number do we have Cj; - 0 in a subset of An, 


5 This is a modification of Lemma (1.2), G. T. Whyburn, Fundamenta Mathe- 
maticae, vol. 25 (1935), p. 410. 
° We shall call a complex D,s obtained this way, a 5;-projection of C; in C. 


= 


74 PAUL A. WHITE. 


of diameter <e-+ 0. We can omit this finite number and suppose that yi? 
consists of only the remainder. 

The r-regular convergence tells us that there are positive numbers N, and 
d, < o/4 such that if n > N, and if y" is a complete cycle of A, of diameter 
< 3d,, then y’~0 in a subset of An of diameter <o/4. Inductively for 
each dj; > 0 and j=r,r—1,---,1 there is a dj_, < dj and a number N;j-, 
such that if n > Nj. and if y/* is a cycle of An of diameter < 3d;_,, then 
yi? ~ 0 in a subset of A, of diameter < dj. Now in Lemma 2. 2 let d=8,, 


¢==e, and ed); then the conclusion is that there are numbers N’, § < 8,, 
and «, < e« such that for n > N’ any r-dimensional 8-cycle in A, of diameter 
= 8, is do-homologous to 0 in a subset of A» of diameter < «. 

Choose NV = N’ + =, and consider any ny > N. In Lemma 2.1 let 

i=0 

A = An,, «=, d= (8, + €)/2, 7 = 41, p =o /2; then the conclusion tells us 
there is a 8 > 0 such that for any &-cycle C" of An, of diameter < (8, + €) /2, 
there is a complete cycle D’ = (D,", D.",- - -) of diameter <e such that 
re Cr for all j in a subset of An, of diameter < ¢ and that the homology 
occurs in the o/2-neighborhood of Since yi" = (C1, +) is a com- 
plete cycle, we can find a Cx” which is both a 8 and a &-cycle. Also by 
hypothesis 8(C x”) 8 < (8 + €)/2. Since Cx" is a 8-cycle Cy? = 
where £’** is a do-complex of diameter <«. Since Cr” is also a &-cycle, 
we can find a complete cycle = (1),", ).",- such that Dif Cr" in a 
subset of An, of diameter < and that D;"C the o/2-neighborhood of C1". 

Since £’** is a do-complex, it is clear by the choice of the numbers d; that 
a Vietoris chain realization of #’** can be constructed.? Furthermore since 
— Cy" the realization of Cx." as a Vietoris cycle = (D,", -), as 
concluded in Lemma 2. 1, can be carried out simultaneously with that of E+? 
in such a way that the realization of each simplex of C1,” is used in the realiza- 
tion of the (7 -+ 1)-dimensional simplices of #7? to which it belongs. Let 
= (£,"", £."*',- - be the V-chain realization of which wil] have 
diameter < + 2(¢0/4) =e, + 0/2. Furthermore the simultaneous realiza- 
tions imply By = D;" for all 7 where 6(£j"*) << e+ 0/2. We can pick j 
such that /;"** is an nr-complex, then od mg 0 in a subset of An, of diameter 
<«+0/2. Also ay! Ctx” in a subset of An, in the (o/2)-neighborhood of 
Cu’. Therefore Cu! 0 in a subset of An, of diameter < + 0/2 + 0/2 
=e¢e+o<e-+o. But this is contrary to the definition of y:” and the lemma 


is proved. 


3 


, p. 0465. 


7 See reference 


1’-REGULAR CONVERGENCE SPACES. 


Lema 2.4. If [Pij]— Pi, [Pi] > P where each Pi; is closed and lc", 


=r 


then there is a diagonal sequence of the [Pij] which converges t-regularly to 
P foralltSr. 

Proof. It will first be useful to generalize Lemma 2.3 by applying it for 
allsr. That is, if > 68, > 0 are numbers such that every complete cycle 
in A of diameter <= 8, bounds in a subset of A of diameter < ¢, then for any 
o > 0 there is an NV such that if i > Ns, and yi* is a complete s-dimensional 


cycle in A with 8(y*) S 8, then yi* ~ 0 in a subset of A; of diameter < «+o. 


r 


Let minimum (8, -,6-) and N=)>N,; then if i> WN and y is 


0 


any complete cycle in A; of dimension =r and diameter S 8, we have y ~ 0 


in a subset of A; of diameter << e+. 
Since [P:] > P, we can find for each «& >0 in a sequence [ex] —> 0. 
=r 


numbers & > 0 and N, such that if i> NM; and y* C P; is a complete cycle 
of diameter = 8 for any s =7, then y’~ 0 in a subset of P; of diameter < «. 
By the above generalization of Lemma 2. 3, if o—« there is a number M;* 
for each 1 > Ny, such that if 7 > M;* and y® is a complete cycle in Pi; for any 
s<=r with diameter = &, then y’~0 in an + &) = 2e-subset of 
Also there are numbers R,* such that if 7 > Ri* then U,,(Pi) contains P;;, 
and U,,(Pi;) contains P;. For each i, Ne<tS New, pick a number 
ny 3 Ry" + > M,;". [Pin,] > P, for if we consider any « > 0 there is a 
sr 


m=1 m=1 
number such that < ¢«/2. Let > then ni > Ri", U.(P:) 
contains Pin,, and U.(Pin,) contains P; since < ¢«/2. Also since ni > Ni 
we know that if y* is any complete cycle in Pin, for any s Sr of diameter < 4, 
then y*~0 in a subset of Pin, of diameter < 2: em < 2: (€/2)!<«. The 
first of these statements implies that [Pin,]—> P, while the second implies 


that the convergence is ¢-regular for all ¢r. 


THeoreM 2.5. For any set i C K; E’ (and hence £) is closed. 


Proof. Consider any point p in (#’)’. By the definition of a limit point, 
there exists a sequence [p;] of distinct points in E’ such that [pi] p. Since 
pie HE’, there is a sequence [pi;] of distinct points such that [pi;] > pi. That 
is to say, [Pi] > P, [Pi;] > Pi; hence by Lemma 2.4 there is a diagonal 


sequence [Pin,] > P or [pin,] > p where [pin,] is in # and may be assumed 


to consist of distinct points. This means p is a limit point of F and £” is 
closed. 


15 

4 

nd 
ter 
‘or 
j-t 
en 
er 

et 

at 
ry 

1 4 

a 4 

t 

1 

=r 


76 PAUL A. WHITE. 


THEOREM 2.6. A necessary and sufficient condition that p be a limit 
point of a set E is that every open set containing p shall contain at least one 
point of E different from p. 


Proof. To prove the necessity suppose that p is in #’. Then there exists 
a sequence of distinct points [pi] — p. such that each pie and pi ~p for 
all 1. Now suppose that there is an open set U containing p, but containing 
no point of E different from p. In particular [pj] CG C(U) which is closed, 
and hence contains p contrary to the assumption that p is in U. 

Conversely, suppose that every open set containing p contains a point of 
EF distinct from p, but that p¢ H’. That is, there does not exist an infinite 
sequence of distinct points of H converging to p. This implies that p+ C(£) 
is open, for C(p + C(£)) =C(p) - £& is closed since E£ is closed by Theorem 
2.5 and p is not a limit point of #’. Thus p+ C(E) is an open set containing 
p, but not containing any point of £ distinct from p, contrary to the hypothesis. 


Hence pe E’, which concludes the sufticiency proof. 


3. The Hausdorff axioms. 

THEOREM 3.1. The space K is closed in H (in fact H’C K). 

Proof. This theorem is a direct consequence of a theorem of G. T. 
Whyburn,* for consider a sequence [;] in H such that [pi] > p. This means 
that [Pi] — P, where the [ P;] are all closed, but the above mentioned theorem 

<r 
states that under these conditions P is closed and Ic’, or p is in K. Thus, in 
particular, H’ C K. and K is closed in H. 

THEOREM 3.2. K is open (in H). 

Proof. The complement of A is the null set which is closed; hence K 
is open. 

THEOREM 3.3. The product of any two open sets is open. 

Proof. Evidently it is sufficient to prove that the sum of two closed sets 
is closed. This follows from the fact that a limit point of the sum of two sets 
C',, Cz is by Definition 2.1 the limit of an infinite sequence of distinct points 
of C, + C2 and hence, by virtue of Theorem 1. 1, a limit point of C;, say. 


_ 


DEFINITION 3.1. If 4 CM, then by hx(A) we mean the subset of K 


corresponding to all subsets of A which are closed and lc’. 


® American Journal of Mathematics. vol. 57 (1935), p. 904. 


n 


n 


’-REGULAR CONVERGENCE SPACES. 
THEOREM 3.4. Jf A is open, so is hy(A). 


Proof. If the theorem were false, then there would be a point p in hx(A) 
and a sequence [pi] in the complement of hx(A) converging to p. This 
means that P is in A and [P;] > P, but Pi: C(A) £0 for alli. But C(A) 


=r 
is compact, hence lim P; = P intersects C(A) contrary to the assumption that 


P is in A. 
THEOREM 3.5. I/f A ts closed, so is hy(A). 


Proof. Let p be any limit point of h,(A). By definition there exists a 
distinct sequence [pi] of points of hx(A) converging to p. This implies that 
[Pi] > P where each P; is in A. By the compactness of A, this implies that 


P isin A, and hence pC hy(A). Thus hx (A) is closed. 


THEOREM 3.6. Jf p and q are distinct, then there exist disjoint open 
sels Uy and Ug, containing p and q respectively. 
Proof. Since p and q are distinct, we can suppose that Q@ P. Then 


there exists an « > 0 such that U.(?) does not contain Q; hence ge the com- 
plement of hk(U-(P)) =Ug. By Theorem 3.5 hy(Ue(P)) is closed; hence 
its complement U, is open. By Theorem 3.4 hx(Ue(P)) = U> is open and 
it contains p. Clearly U, and U, are disjoint and therefore fulfill the require- 
ments of the theorem. ° 


THEOREM 3.7. Jf “neighborhood” is interpreted to mean “open set,” 


then K is a Hausdorff space. 


Proof. The proof follows immediately from Theorems 3. 2, 3.3 and 3. 6. 


4. Regularity. 


THEOREM 4.1. For any point p and «> 0, the subset of K consisting 
of all points of K corresponding to sets of M whose e-neighborhoods contain P 
is Open. 

Proof. Suppose that the theorem is false for some point p and e > 0. 
Then the set described in the theorem contains a limit point g of its com- 
plement. That is, U-(Q) contains P, but there exists a sequence [Qi] oe 

=f 


such that P is not contained in the e-neighborhood of Qi for each 1. P and Q 
are compact ; hence there exists a farthest point x of P from Q and d = p(z, Q) 


it 
1€ 
ts 
or 
ig 
d, 

) 
n 
| 


PAUL A. WHITE. 


<«. This implies that Ure.a/2(Q) contains P. But the convergence of [Q;] 
to Y implies that there is a number N such that U¢e-a)/2(Qw) contains Q, and 
hence U (e-a)/2+e:4)/2(Qw) contains P, or U.(Qy) contains P, which contradicts 


the above assumption on Qy and P. 


DeEFINITION 4.1. If PCM and « > 0, by Ae» we shall mean all points 
of K corresponding to sets Q in M such that U.(Q) contains P, U.(P) con- 
tains Q. 


CoroLuary 4.11. For every point p and e > 0, Ae» is open. 


Proof. By Theorem 3. 4 i (P)) is open. Also the set O defined in 
Theorem 4.1 is open. The set A¢y = O-hx(Ue(P)), and is open by Theorem 
3. 3. 


DEFINITION 4.2. The set of points p of K with the property that any 
complete s-dimensional cycle y* in P with diameter less than 8 is homologous 
to zero in a subset of p of diameter less than e, will be denoted by K*,.s. 

The sets K*z5, K*.5, K*.s will be defined similarly where € means that 


= replaces < «, etc. 
DEFINITION 4.3. Kes= I] K <6, ete. 
THEOREM 4.2. is foralld>0,e>0,andsSr. 
We first prove two lemmas. 


Lemma 4.3. If [Ai] is a sequence of closed sets converging to A, and 
e>0,d>0 are numbers such that for every « > 0 there exists a 8 > 0 and 
an N such that if n= N, any s-dimensional 8-cycle in An of diameter < d is 
e-homologous to 0 in a subset of An of diameter Se, then any complete 
s-dimensional cycle in A of diameter < d is homologous to 0 in a subset of « 
of diameter S e.° 

Proof. Let = (Ci, C2,: be any complete s-dimensional cycle in A 
of diameter < d, and let d’ be any number such that 8(y*) <d’<d. We 
have to show that for any e > 0, there exists an integer M such that for k > M, 
Cy; ~ 0 in a subset of A of diameter Se. By hypothesis, given ¢/4, there 
exist positive numbers 8 < « and N such that for any n > N, any s-dimensional 
8-cycle in A, of diameter < d is er 0 in a subset of An of diameter Se. Let 
us take M such that for k > M, Cy is a 8/3-cycle; and with k fixed and > M, 


® This is a modification of Lemma (1.1) of G. T. Whyburn, see footnote °. 


78 
i 


in 
em 


ny 


us 


rat 


’-REGULAR CONVERGENCE SPACES. 


let Cy = (%,%,° * *,%y), where the a; are the vertices of Cy. Take an 
integer J such that for 1 > J, we have A; C Us (A) and A C Ag (Ai) where 
= min. (8/3, [d—d’]/2). Take a fixed +N and &-project into 
C*; = (Yo Yo), Where yz is the projection of 2+. 

Now since p(yt,; Ym) S p(yt, + p(&t, tm) + p(Zm, Ym) < + 8/3 
+ S64, it follows that C*; is a 8-cycle. Also since p(yt, ym) +d’ 
< (d—d)/2+ d+ (d—d’)/2 =d, then C*; is of diameter <d. Thus 
by hypothesis there exists an ¢/4-complex = (2,21, -,2n) in Aj of 
diameter = e bounded by C*;. Project Z*** into a complex K*** in A by a 
5'-projection. Then clearly K**! is an (€/4 + 28 < €/4 + 28/3 < €/4 + 2/3 
== 11e/12)-complex in A of diameter Se+ 2(d—d’)/2=e+d—d’. 
Thus Cx .0 in a subset of A of diameter = e+ d—d’, for any k > M. 
We note that d’ was an arbitrary number (8(y*) < d’<d) and that the 
choice of M did not depend on our choice of d’; therefore we can-assert the 
above statement for / > M where d—d’ is arbitrarily small. We shall show 
that this implies C, ~ 9 in a subset of A of diameter Se. To this end pick 
a sequence |d’;| such that [d-—d’;] > 0. There exist each 11¢/12- 
complexes such that = and 8(Kist!) Sd—d';-+-e. Suppose that 
the sequence [d’;] was so chosen that the point sets making up the complexes 
[Ki*] converge to a point set K, which must clearly contain Cy and have 
diameter =e. Pick an integer i such that Ueyos(K) contains K;. Now let 
K** be the ¢/24-projection of K;**! into K with points of C; fixed., Clearly 
and 8( Se, also K** is an (11¢/12 + €/24-+ ¢/24 —e)- 
complex. That is, Cx~0 in a subset of diameter Se, and our lemma is 
proved. 


Lema 4.4. Jf [Ai] is a sequence of closed sets converging S r-regularly 
io A,ande>0,d>0 are numbers such that every complete s-dimensional 
cycle (0SsSr) of diameter << dis ~0 in a subset of Ai of diameter 
=e fori > N’, then for any numbers 0 d’< d,e>0,4>0 there exists a 
number N such that if n> N and C® is any 8-cycle in An of diameter < d’ 
then les 0 in a subset of An of diameter < e+ 7». 


Proof.' Let «>0, 7>0, d’>0 be given and let 8; = min. d, 
[d—d’]/2). By virtue of the (s —1)-regular convergence, there exist posi- 
tive numbers 8,-, and Ns-,; such that if n > Ne-:, then any y** in A» of 
diameter < 38;., is ~ 0 in a subset of An of diameter < 8s. Likewise, by 


19 See footnote ° 
This proof is a modification of the proof used in Theorem 1 of the paper 


referred to in footnote °. 


79 
nd 
cts 
yn- 
= | 
nd 
nd 
is 
te 
A 
A § 
Ne 
M, § 
are 
ial 
et 


80 PAUL A. WHITE. 


virtue of the (s — 2)-regular convergence, using 8s-,, we have positive numbers 
8s-2 and such that if n > then any y** in A, of diameter < is 
~~ 0 in a subset of A, of diameter < 8y_;. Continuing as in Lemmas 2. 1 and 
2.3 by the same argument. we reach numbers 8 and N» such that if n > N, 
then any y° in A, of diameter < 38, is ~ 0 in a subset of Ay of diameter < 8). 


8 


We may suppose > >- Let and We shall 


0 
show that these are the numbers satisfying the requirements of the theorem. 
To this end let n be any integer > N and let C* be any s-dimensional 
8-cycle in A, of diameter < d’. Again as in Lemmas 2.1 and 2. 3, it is clear 
that C* has a Vietoris cycle realization Z = (4, 22,- : -) in An of diameter 
d’ + 2(d—d’)/2=d. Also as before Cree ze in a subset of A, of diameter 

< d-+ 28, for each k. Whence C* ~ % for all & since 8, Se. 
Now since 8(Z) <d and since n > JN, it follows by hypothesis that 
Z—~ 0 in a subset of An of diameter =e. The use of the prism construction 


in the 8:-homology between C* and z, uses no new vertices; hence every point 


of the complex bounded by C* + z is within a distance 8, of some point of 2. 
Also every point of the e-complex bounded by z is within a distance e of any 
point in z,; therefore the sum of these two complexes is an e-complex bounded 
by C* with diameter << e+8,< e+ That is, C?’~0O in a subset of An 
€ 
of diameter < e+ y, which was to be proved. 
We now proceed to the proof of the theorem. To this end consider a 


sequence [ai] in A%éa converging toa. This implies that [Ai] — A, and that 
= 


any y® in A; with diameter <d is homologous to 0 in a subset of A; of 
diameter Se. If 7 > 0 and d’ are numbers with 0 < d’ < d, then by Lemma 
4,4 e+» and d’ are numbers such that for any e > 0 there exist numbers 
5 > 0 and N such that if n > N and C* is a 8-cycle in Aw of diameter < d’ 
then C* ~ 0 in a subset of An of diameter << e-+ 7 and hence =[e+7y. Now 
by Lemma 4.3, any complete s-dimensional cycle in A of diameter < d’ is 
~ 0 in a subset of A of diameter [e+ 7. Since d’ and y are arbitrary, we 
can conclude that every complete cycle in A of diameter < d is ~ 0 in a subset 
of A of diameter < e+ 7 for all 7 > 0. Thus if [»;] is a sequence converging 
to 0, then there exist complexes [D)j;**'] such that Djs = C* and 8(D;**) 
<e-+ 7; where D;***C A. We can suppose the 7;’s so chosen that the point 
sets [Dj;**!] converge to a limit L. Clearly 8(1,) Se and L contains the point 
set C*. We shall exhibit a complete (s+ 1)-chain in LZ bounded by C*. To 
this end consider a strictly monotone decreasing sequence [a;]—>0. We can 
pick a subsequence of the point sets | D;**'], suppose it to be the whole sequence, 
such that contains Z and contains Dj** for each j. 


’-REGULAR CONVERGENCE SPACES. 81 


Corresponding to «;/3 there is a number Nj such that if Dj**? = (D,;*"', 
+), then for 1 > Nj, is an «;/3-complex. Project each of the 
D,j8" for 1 > Nj; by means of an a;/3 projection ** into a complex Fi;**! of L 
keeping points of fixed. Clearly, for 1 > Nj, 8(F#ij**!) Se and is 
an j-complex for all i such that Hi;5**=Ci8. We do this for each j and 
define a complete complex /’%*! as follows: 


2,1 ? Ni +1,1? Ni+N2-1, 1? 


Now Ei;,8*1 = C;8 for each i and §(4;;,8*') Se. Finally consider any number 
a>0; there exists an and any This 
implies that is an a-complex; hence C*—~ 0 for all a > 0, or C*?~0 
in a subset of A of diameter =e. This tells us “that a ¢ K*q which is there- 


fore closed. 
Coro“uary 4.21. Kea is closed for each e >0 and d> 0. 


Proof. By definition Kea= I K*sa, where K%zq is closed by Theorem 
4.2. Now Kea is closed for, by Theorem 3.7%, K is a Hausdorff space in which 


we 


it is always true that the product of any number of closed sets is closed. 


CoroLuary 4.22. [] = 
n>0 
Proof. This follows immediately from a part of the proof. That is, 
we showed that if every complete cycle in A of diameter <d is ~9 in a 
subset of A of diameter < e+ 7 for all » > 0, then every complete cycle of 


diameter < d bounds in a subset of diameter S e, or in other words J] K*e.n,a 


C K*zq. The inclusion in the other direction follows immediately from the 


definitions of the sets involved, hence the equality. 


4.23. [] Keina = Kea. 


n>0 
Proof. Ka: II Ks e+, d3 q = Il II e+), II K8esn,a 
n>0 sSr sSr 7>0 
= II Ks K 


THEOREM 4.5. is intertor to K%ea. 


Proof. Suppose that the theorem were false; then there would be a point 
pin K*,_¢@, but not interior to K*%a. That is, there exists a sequence [pi] of 


12 See footnote °. 


6 


's 
is 
d 
il 
it 
n 
y 
n 
a 
it 
f 
a 
WV 
e 
t 
) 
t 
t 
0 


82 PAUL A. WHITE. 


points in C(K*sa2) converging to p, or [Pi] rs P, where each P; is Ic’. Now 
e—o and d are numbers such that every y* of P with diameter = d is ho- 
mologous to 0 in a subset of P-of diameter << e—a by the of K* 0,4. 
But by Lemma 2.3 where « = e—o, s =r, 8, =d, and o =o we know that 
there exists a number WN such that if i > N, then every y* in P; with diameter 
= d is homologous to 0 in a subset of P of diameter << e—o-+oa=—e. This 

ys that for i> N pie K*,.¢@C K*ea which is contrary to our hypothesis; 


hence the theorem is true. 
CoroLuARY 4.51. Ke-od ts interior to Kea. 


Proof. (interior of K*za) by Theorem 4. 5. 


This in turn is ssidienal in the interior of [][ K*za, since K is a Hausdorff 


space, but this latter set is the interior of K¢a which was to be proved. 
THEOREM 4.6. K is regular. 


Proof. let Op be any open set containing a point p. Consider a strictly 
monotone decreasing sequence [¢;]—>0. Since P is lc’, for each e; there is 
a di < e; such that if y*® is a complete cycle in P with diameter = d; where 
s is any number =r, then y*~0 in a subset of P of diameter < e;/2. Thus 
for all 4, pe Let An =I Ae,» By Corollary 
4,21 each Kz,,a, is closed. Now there exists an N such that A, C O,; for 
if not, then there exists a point p, in the set An-C(Op) for each n. Since 
€n-1 > €n, we have Ae, »C Ae,» for each n; hence [P,]—>P. Also for any 
e > 0 there is an @¢m < e, and we shall let dd». Now consider P; for k = m, 
and a y® in P; for some s =r such that 8(y*) << d. Since k= m, pre Kz,,a, 
C Kz.,,.a, and y’~ 0 in a subset of Ax of diameter S em <e. This says that 
[Pal z = P or [pn] > p, but this is impossible as each p» was in C(O») which 


is aed. Now by Corollary 4.51 pe Ke,-e,/2,4,C interior of Kz,,a, for all 4. 


It follows that pe Aey.p il interior Ké,,a,° Ae,.p C interior 


N 

IT Av CO, If we let Up interior 

1 

Ke,,a,* Aey,p Which is open, then “ C Ay =Ay CO, and K is regular. 
CoroLuary 4.61. Corresponding to any point p, there exists a countable 


monotone decreasing sequence of open sets [O;] whose product is p, and such 
that if U is any open set containing p, then there is a number N such that 


Oy CU. 


rift 


or 


or 


T-REGULAR CONVERGENCE SPACES. 83 


ke 
Proof. Wet O; interior Ae,,p a8 in the proof of Theorem 4. 6. 
1 


Then [[ 0: = p for if not there exists a point q different from p in J[ 0i. 
1 1 


But by Theorem 3.6 there exists an open set Up containing p such that q¢U> 
and by the proof of Theorem 4. 6 there exists a number N such that Oy C Up. 


. 
Thus [] 0; C Oy C Oy C Uy, contrary to the assumption that q is im 
1 
1 
Also in the proof of Theorem 4. 6 the second half of the theorem is proved. 


5. The metrization and separability of K. 


THEOREM 5.1. Corresponding to any set Res = > KZpaun there is a con- 
n>0 
tinuous function f of K into the interval OS «Se such that f(Rzs) > 0 and 


f(C( Res) ) = 0. 


Proof. We define f(p) as follows: If pe Ris, f(p) =1.u.b. » such that 
pe If pe Res, f(p) =0. Clearly 0Sf(p) Se. Now f is con- 
tinuous, for consider [pi]—>p. Let be any number > f(p); then by 
definition of f(p) we have p¢ Kzj,,5.n, By Corollary 4.21, Kay,0n, is closed 
and hence C (K=j,,5.n,), which contains p, is open. This implies that almost 
all [pi] are in C(Keq,%n,) and hence f(pi) Sx, for almost all i. This 
completes the proof of the continuity if peC(Rés). If, however, pe Res, 
let n2 > 0 be any number < f(p) and pick 7 such that ne < ys << f(p). Now 
pe K&,5.n, by the definition of f(p). We can now show that almost all 
[pi] C To this end let Now m2 and 8+ 
<8+ 3, pe C Ke--0),5m, By Lemma 
2.3 almost all [pi] C = Ke-cns-20),5 = 5g, C 
therefore f(p) = for almost all Now m2 < f(p) < m where and m 
were arbitrary and we have just shown that for almost all i 7. Sf(pi) Sm; 
hence [f(pi)] converges to f(p) which concludes the continuity proof. 

Furthermore if peC(kzs), f(p) =0 by definition. Finally if pe Rés, 
then pe K=7,5.y for some » > 0 and f(p) 27> 0. This concludes the proof 
of the theorem. 


THEOREM 5.2. H (and hence K) is a subset of Hilbert space if we 
understand that ordinary convergence takes the place of regular convergence 
in the definition of a limit point (that is, [pi] —> p if and only if [Pi] > P). 


ho- 
hat 
ter 
his 
318 ; 
. 5. 
ly 
is 
Te 
us 
ry 
or 
ce 
1y 
n, 
dy 
it 
h 
1. 
e 
t 


84 PAUL A. WHITE. 


Proof. Although this theorem is by no means new, its proof follows 
almost immediately from our previous work. We have only to note that in 
most of our theorems it was necessary to prove convergence first and then 
regular convergence. Thus if we leave off the part of our proofs about regular 
convergence, we get the same theorems for H with this new limit notion, 
that we proved before for K. In fact we can say that H is a regular Hausdorff 
space, for each of the theorems involved in proving this are of the character 
described above. It remains to show the existence of a fundamental sequence 
of regions in H. To do this we note that since M is compact and metric, it 
has a fundamental sequence of regions (Ri, R2,---). Define pi,i,...i, to be 
the point of corresponding to Ri, + Ri, +--+ Ri,; then the collection 
[pi,...i,], where n and each i; range over all integers will be shown to be 
everywhere dense in H. If p is a point of H then P is compact and can be 


covered by a monotone decreasing sequence of sets Ri,, + Ri, +: + Ri, 
Ri,, + Ri +: Ri,,, whose product is P. Thus the sequence 
> p as j—> 0. Now we can associate with each pi,i....i, the 
sequence [Ai/x,p,,;,...:,]- This gives a countable collection of sets which serve 
as a fundamental sequence. For consider a point p and any open set U con- 
taining p. There will be a set Ai/ox,» C U and a point p,,,,...,, € A1/2%,». Now 
contains p and C U, which is the defining property of a 
fundamental sequence. It is a well known result that we can set up the Hilbert 
metric in such a space, and since A is a subset of H, it too is homeomorphic 


with a subset of the Hilbert space. 


THEOREM 5.3. K can be metrized by means of the Hilbert metric in such 
a way that the regular convergence definition of a limit point is preserved. 


Proof. Let us consider the sequence [1/2‘/?], and let 8; = = 
(t=1,2,---). In Theorem 5.1 denote by fi; the function corresponding 
to Rz,,s,. Finally let fo,(p) = 2 (the j-th codrdinate of p in the Hilbert 
metric established in Theorem 5.2). Then the distance under this metric 


between two points p and g of K would be pi (p,q) (fos (p) —fos(q))*, 


Qi 


where > foj?(p)/2/ exists for all p in K. 
g=1 


The Hilbert codrdinates of this theorem will be [fi;(p) /24/*] where (0 S1 


<j), (j=1,: -), and we shall define | (fas —fes(Q))*, 
94 
We may think of the codrdinates as a single sequence where f;;(p)/24” 


2 


r-REGULAR CONVERGENCE SPACES. 85 


precedes fxz(p)/2"* if 7 < or if j=l andi< k. These are possible Hilbert 
fis? (Pp) fis 4 for (Pp) 


coordinates for 


We already know 


j=1 i-0 23 =1 j=1 23 
that the second part converges so we shia only consider ry first. We recall 
oO 
that fij(p) S«i; hence fas" fos (p) =1. Thus both 
i= 4= i=1 j=1 24 g=1 24 


the first and second, and hence the entire series converges. 
We now proceed to prove that p(p,q) has the several properties of a 


2 
metric. For simplicity let p2(», ¢) (fis(P) 


23 


1) That p(p,q) is defined and 20 for all p and q follows from the 
above discussion and the fact that all terms are positive. 

2) Clearly p(p,q) = p(q. p). 

3) p(p,q) = 0 if and only if p==4. 
Clearly p(p,q) = 0 if p= Conversely, if p(p, q) =0 then foj(p) = fo; (q) 
for all j, hence p:(p,q) and p= q since is a metric. 

4) The triangle inequality follows in the usual way for this type of metric. 

5) The point p is a limit point of the set # if and only if for any e > 0, 
there exists a point g of EF such that p(p,q) <.«. 

First let p be a limit point of a set #, then there exists a sequence of 
distinct points [j] of # converging to p. This means that [Px] = P. Since 
P, there exists a number N, such that for k > N, < ¢/2. 


Also there is a number such that > 1/25 < ¢/4. Let 1/2) == K. 


f= ASS 
By the continuity of fi; (t+=0) we can find an Ns; such that for k > Ns, 
—fis(pe)| << /4K (t=—1,- - +573; g=l,- -,N2). Thus for 
No 
k > No + Na, po(p, pr) < (> i/24) +¢6/4 &(/4K - K+ €/4 


=1 
¢/4=c/2. Finally p(p, S pi(p, pr) + p2(ps Pe)- Therefore for 

k>N,+ Ns, p(p, pr) < 6/2 + €/2 = 
Conversely, suppose that for each « > 0 om is a point g in £ different 
from p such that p(p,q) <«, but that p is not a limit point of £. This 
P) 
which is open. Now for each «&,; there exists a number 8 < ej4; by virtue of 
the local y*-connectedness of P for s r such that pe Kz,,,6 Let» =(V2—1)8 
and V2 =d—y. Then pis in Finally 


implies that p ¢ 2’ and hence not in /& — p which is closed. Thus pe C(£ 


let 8;, be any member of our original sequence [8;] such that 8;, << 8’. Clearly 
we also have pe K7,5,,4n C Ré,,5,,- Also it follows readily from the definitions 


86 PAUL A. WHITE. 


that Rz,,0,,C Ké,.s,, Now in proving Theorem 4.6 we showed that corre- 
sponding to any open set and a point in it there is a number N such that 


N N 
p is in JI (Ra.s,,) C IL C p)]. The defini- 
4=1 i=1 


tion of Aps,, has already been given (Definition 4.1), but if we think of it 
here as the set of points g such that p(q, p) < 8jy, it is clear that the proof 
follows just as before. Now by Theorem 5.1 fij;,(p) >0 for i=—1,:--,N 
since pe Rz,,5,,; hence we can let L = the smallest of the numbers fi;,(p) for 
i=1,---,N. By hypothesis there exists a point q different from p and e £ 
such that p(p,q) < min. (djy, Since pi(p,g) S p(p,q) < din, we 
have geAps,, Where we use the nc sind definition of this set. Also 


| fis (p) —fas(q) = (esp) — Fig) = p2(p.7) Sp(p,q) < L 


for any i=1,---+,N. Hence | fi;(p) — fis(q)| < < L and 
fis,(¢) > fis,(p) —L = 0 for i—1,: - By Theorem 5.1 this implies 


that qe Re,,s,, for t—=1,---,N, and Re, CI Ka, Ap, 


CC(£—>p). This however contradicts the fact that ‘and and 
p must be a limit point of #. This concludes the proof of the theorem. 


CoroLuary 5.31. K is separable. 


Proof. This follows since, by Theorem 5.3, K (with the regular con- 
vergence limit notion) is homeomorphic with a subset of Hilbert space which 


is separable. 


Coro.Luary 5. 32. Contained in any compact metric space M is a countable 
collection of closed locally connected subsets such that a subsequence of the 
collection can be found corresponding to any closed locally connected subset 
oj M which converges to it 0-regularly. Furthermore if M is locally connected 
then a subsequence can be found corresponding to any closed subset of M 


which converges to it in the ordinary sense. 


Proof. The first part is merely a restatement of Corollary 5.31 where 
r == 0, for local y°-connectedness is exactly the same as local connectedness in 
compact spaces. The second part is also immediate for if M is locally con- 
nected then the locally connected closed subsets are everywhere dense in the 
space of all closed subsets where the ordinary convergence notion is used. 
Thus if our collection has subsequences converging regularly to all closed 
locally connected subsets, it also has subsequences converging to any closed 


subset. 


l- 


’-REGULAR CONVERGENCE SPACES. 


6. The connectivity of K. A large part of this section is dependent 
on some general theorems on Betti numbers which are now stated. 


THEOREM 6.1. The r-dimensional Bettt number p"(M) of M is finite tf 
M 1s Ic’. 

Proof. This is a theorem of R. L. Wilder. For the proof see the Duke 
Mathematical Journal, vol. 1 (1935), p. 546. 


THroreM 6.2. If the sequence of closed sets [Ai] >A, and p*(A) =n 
for some s Sr, then there exists a number N such that Poot t>N p*(Ai) Zn. 


TueoreM 6.3. If the sequence of closed sets [Ai] >A, and for some 
number s Sr p*(A)=n, then there exists an N such that for i>N 


Both of the above two theorems are due to H. A. Arnold, but have not as 
yet been published. The following three corollaries follow immediately from 
these theorems. 


CoroLiar 6.31. If the sequence of closed sets [Ai = A where p*(A)=n 
for some number sr, then there exists a number ‘such that for 1 > WN 
p*(Ai) = 

CoroLuary 6.32. If the sequence of closed sets [Ai]—>A where for 


almost all i p*(4A;) =n for a number s Sr, then p*(A) =n. — 


CoroLiary 6.33. If the sequence of closed sets [Ai]>A where for 
almost all 1 p*(A;i) for a number s Sr, then p*(A) An. 


DrFINITION 6.1. We shall denote by Kngn,,...,n, the set of all points p 
of K corresponding to subsets P of M such that p*(P) = ngs (s =0,:--,7r). 


THEOREM 6.4. Kno n, 


n, 8 both open and closed for all numbers ns = 0, 


Proof. If p is a limit point of the set in question, then there exists a 
sequence of points in Knn,...,n, such that [pi] —> p. This means that 
p*(Pi) = ny for all ¢ which implies by corollary 6.32 that p*(P) =n, for 
each s =r, hence pe Knon,...,n,- This proves the set is closed. 

Next we show the set is open for if it were not then a point p of it would 
exist together with a sequence [pi] of the complement converging to it. That 
is [Pi] = P where p*(P) =n, fors Sr. But by Corollary 6.31 p*(P:) = ne 
for almost all ¢ and each s S71; hence pi ¢ Kn n,,...,n, for almost all i, contra- 
dicting the hypotheses on them. Thus Kngn,...,n, pees also be open. 


87 
| 
t 
e 
N 
d 
N 
| 
‘h 
le 
et 
re 
in 
n- 
ie 
d. 


88 PAUL A. WHITE. 


oo 


4 


DEFINITION 6.2. Let 3 
ni=0, i8 


THEOREM 6.5. The set Ky,* is both open and closed for all numbers 
sSrand = 0. 


Proof. The proof is the same as that of Theorem 6.4 if we fix our 


attention on a particular s throughout the discussion. 
CoroLiary 6.51. K will be connected if and only if M is a single point. 


Proof. If K is connected. then K = K,°.1% Thus M cannot contain two 
points and M =a single point. 
Conversely if 1/ =a single point, it is clear that K is a single point which 


is connected. 


CoROLLARY 6.52. K will have a finite number of components if and 
only if M ts finite. 


Proof. If K has only a finite number of components, then since K = > K;°, 


j=0 

we have K;° = 0 for all i > some number NV. That is, there exists no locally 

y*-connected subset with more than NV components. In particular, W/ cannot 
contain more than 1 points. 

Conversely, it is clear that if WV is finite then so is K, and K has only a 


finite number of components. 


THEOREM 6.6. /f M=M,+ M.+---+ Mn, where M; is open and 
closed for each i, then K,° = Ki2° +: Kin°® where each K,,i° 
is both open and closed. 


Proof. Define K,,i° = K,°-he(Mi), By Theorem 6.5 
K,° is both open and closed. By Theorem 3.4 and 3.5 hx(M;) is both open 
and closed, and hence so is the product K,i°. Now K,°C S K,.i°, for if 
i=1 
pe K,°, then P is a locally y’-connected continuum and is contained in one set 
M;. That is, and K,°C > K,i°. Now clearly 
n 


> K,°:C and K,° = > which was to be proved. 


i=1 


13 We assume that a 0-dimensional cycle is any number of points in this discussion. 
This implies that p°(M) =the component number, 


= 
= 


r-REGULAR CONVERGENCE SPACES. 89 


THEOREM 6.7. If K,° is disconnected and has m components, then K,° 
n 
is disconnected, and has at least & mCi-nSi components where mC; is the 
number of combinations of m things i at a time and 1S; is the number of 
permutations of positive integers whose sum is n taken i at a time with repeti- ° 


tions allowed. 


Proof. By definition every point of K,° corresponds to a set with n com- 
ponents. Suppose these components to be picked from i of the m components 
of K,° and that j:, j2,° * +, 4% are the numbers of components of the set which 
correspond to the 7 different components. Thus ji + =n. 
Now clearly the 1 components can be chosen in mC; ways and the n components 
of the set can be picked so that their correspondents in K come from the 1 
components in ,S; ways. Therefore, there are mCi- nS; choices in all if 1 
m 
components are used, hence in all § mCi: nSi choices. Now consider a subset 
i=1 
C of K,° corresponding to all sets chosen in one of the above ways. Suppose 


m 

K,° => K,;, where [K,;] are the components of K,°._ Now each point in C 
j=! 

corresponds to a set with J, components corresponding to points in K4,,,;, com- 


;, components in Kyx,. C is closed, for consider [pi] CC 


ponents in 
such that [pi] >p or [Pi] => P. Now P has n components since K,° is 
closed. Also a subsequence of [Pi] can be picked such that there exist n 
component sequences converging S r-regularly to n components of P and such 
that each sequence is contained entirely in the same set Kyq. Since each Kig 
is closed, each component of P belongs to the same set Kiq as the members 
of the sequence converging to it and peC. 

An entirely analogous argument shows that K,»°—C is closed which 
implies C' is open as well as closed in K,°. Thus each component of K,° C one 
of the sets C which implies that the number of components is = (number of 

m 
sets C) = 
i=1 

Theorem 6.7 leads us to restrict our connectivity considerations to the 
set K,°. 

DEFINITION 6.3. We shall say that a subset P of M can be S r-regularly 
deformed into a set Q provided there exists a family of set functions [f+(P) ] 
(0=t=1) such that f.(P) =P, fi(P) = Q, fr(P) CM for all ¢, and if 
[¢;] then [f:,(P)] = fi(P). 


TiHkoREM 6.8. A necessary and sufficient condition that the closed tc* 


y 
IT 
0 
h 
d 

| 
) 
t 
l 
) 
) 


90 PAUL A. WHITE. 


subset P of M be Sr-regularly deformable into Q is that p and q can be 
joined by an arc in K. 


Proof. Suppose that P can be regularly deformed into Q by means of 
‘the set functions [f:(P)]. Then > f:(p) C K, where fr(p) is the point of K 
t=0 


corresponding to f+(P), is a continuous image of the interval (0S t= 1), 
and hence a locally connected continuum. Thus it contains an are from p to q. 

Conversely, suppose there exists an are pg which we shall think of as the 
homeomorphic image of the interval (0 [¢=1) where 0 corresponds to p 
and 1 to g. Define f:() to be the set corresponding to the point of the arc 
associated with ¢. Clearly this set of functions defines the regular deformability. 


CoroLLarRY 6.81. A necessary and sufficient condition that each Ic" subset 
of M with n components be S r-regularly deformable into any other set of the 


sume type is that K,,° be arcwise connected. 


THEOREM 6.9. Jf M is a continuum containing no simple closed curve 


then K,° is connected. 


Proof. Every le” subcontinuum is of course locally connected and since 
it contains no simple closed curve by hypothesis, it must be a dendrite. Now 
any dendrite D of M can clearly be deformed to any point P of it without 
going outside of itself, if we remove the regular convergence restriction. That 
is, there exists a family of set functions [f:(D)] such that f;(D) CD 
(0 St51), =9, = 1, and [ti] implies [f:,(D)] > fi(D). 
But f:(D) must also be a dendrite for each ¢ and all of the convergence takes 
place in a dendrite, hence it is easy to see that the convergence is automatically 
regular. Now by Theorem 6.8 there exists an arc dp in K and hence in K,°. 
But the set G of K,° corresponding to the points of M is clearly connected 
since M is, hence K,°=G-+ > arcs dp is connected. 


THEOREM 6.10. Jf r > 0 and K,° is connected, then M is a continuum 


containing no simple closed curve. 


Proof. By Theorem 6.6 M is a continuum. Also by Theorem 6.5 K,° 
and K,' are both open and closed subsets of K. Thus K,°- K,' is both open 
and closed. But K,° is connected, hence K,° K,°:K,' or K,°: K,' = 0. 
Now if K,° = K,°- K,) then K,° C K,' which implies that J must not con- 
tain a simple closed curve, for a simple closed curve would correspond to a 
point in K,° but not in K,’. Now K,°- K,' = 0 cannot occur as a single point 
corresponds to a point in both K,° and K,', thus the theorem is true. 


be 


of 


’-REGULAR CONVERGENCE SPACES. 91 


The following corollary follows immediately from Theorems 6. 9 and 6. 10. 


CoroLuary 6.10.1. If r > 0 and M is locally connected, then a necessary 
and sufficient condition that M be a dendrite is that K,° be connected. 


7. Compactness and local compactness of K. 


THEOREM 7.1. The properties that K be compact, M be finite, and H=K 
are equivalent. 


Proof. Suppose that K is compact but that M is infinite. Then there 
exists a sequence |p; | of distinct points of M converging to p such that pi ~ p 
for any i. Let Qi = pi+p for each i. Then [qi] C K is an infinite subset 
of K with no limit point for although the sets Qi converge they do not con- 
verge 0-regularly nor does any subsequence. Hence M is finite. 

Next suppose / is finite; then clearly every subset of M is finite, and 
hence Ic’. That is, K., 

Finally if 7 = K but K is not compact, then K must be infinite. This 
oi course implies M/ is infinite, and we can again find an infinite sequence [pi] 
of distinct points of M converging to a point p. But the closed set [(pi) + p] 
is not locally y°-connected, and thus corresponds to a point of H which is not 
i: K. This contradicts our hypothesis and implies that K is compact. 


THEOREM 7.2. If n>1, Kn,o,0,...,0 will be compact if and only if M 


is finite. 


Proof. If Kn.o,,...,0 is compact but M is not finite, then there exists 
a convergent sequence [i] of distinct points of M converging to the point 
po. Now consider the sets 4i = + Piso +° pisn for 1 = 0,1, 
CO 
Clearly aie Kn,o,...,0 for all i but the infinite set } a; contains no limit point. 
i=0 
This contradicts compactness and M must be finite. 
Conversely, if MW is finite so is K and the subset Kn,o,...,o which is there- 


fore compact. 


7.21. If n> 1, a necessary and sufficient condition that K,° 
be compact is that M be finite. 


Proof. Uf K,° is compact so is its closed subset Kn,o,...,o and by Theorem 


7.2 M is finite. The converse is obvious. 


THEOREM 7.3. A necessary and sufficient condition that K,° be compact 
is that M contain no non-0-regular convergent sequence of arcs (t.e. every 
convergent sequence of arcs in M converges 0-regularly). 


_| 
_| 
K 
), 
q. 
1e 
p 
rc 
y. 
et 
Lé 
Ww 
it 
it 
y 
0 
d 
0 
t 


92 PAUL A. WHITE. 


Proof. If K,° is compact, then every convergent sequence of locally y*- 
connected (s =r) continua in M must converge regularly. For if not we could 
get by Theorem 1.4 a sequence no subsequence of which converges regularly, 
and the points of K corresponding to this subsequence would have no limit 
points. In particular, every convergent sequence of arcs converges regularly 
and hence 0-regularly. 

Conversely, if M has the desired property, then it must contain no simple 
closed curve. For clearly there exists a non-0-regular convergent sequence of 
arcs on any simple closed curve. Thus every Jc continuum of M is a dendrite 
and contains no complete cycles of dimension 21. Now if K,° were not 
compact there would be a sequence [pi] of distinct points in K,° which con- 
tains no convergent subsequence. We can suppose that [Pi] converges in the 
ordinary sense; and since by the above each P; is a dendrite, this convergence 
is s-regular for 1 =s<Sr. Hence if [Pi] does not converge regularly, it does 
not converge 0-regularly. This implies that for some « >0 there exists a 
sequence of point pairs in the successive sets Pi; CG in a continuum of P; of 
diameter <«.** In particular the are joining this point pair has diameter 
=e. Clearly this sequence of arcs does not converge 0-regularly contradicting 


= 


our hypothesis and we conclude that A,° is compact. 


CoROLLARY 7.31. A necessary and sufficient condition that K,° be a 
continuum is that M be a continuum containing no non-0-regular convergent 


sequence of arcs. 


Proof. If K,° is a continuum then M is a continuum by Theorem 6. 6 
and has the desired property by Theorem 7.3. Conversely, if MZ has the above 
property then it certainly contains no simple closed curve and by Theorem 6. 9 
k,° is connected. Finally by Theorem 7.3 K,° is compact, and hence a 
continuum. 


CoroLuary 7.32. If M is locally connected, then a necessary and suffi- 


cient condition that K,° be compact is that every component of M shall be a 
dendrite. 


Proof. If every component of J is a dendrite, then M contains no non- 
Q-regular convergent sequence of arcs, and by Theorem 7.3 K,° is compact. 
Conversely, if K,° is compact, then by Theorem 7.3 M contains no non- 


0-regular convergent sequence of arcs, hence contains no simple closed curve, 


and hence each component is a dendrite. 


14It is easy to see that if in the definition of 0-regular convergence, we think of a 
0-cycle as a pair of points, then ~ 9 means that the pair lie in a continuum. 


nt 


l- 


e, 


’-REGULAR CONVERGENCE SPACES. 93 


The next corollary follows in the same way from Corollary 7. 31. 


Corouiary 7.33. If M is locally connected, then a necessary and suffi- 
cient condition that K,° be a continuum is that M be a dendrite. 


We now establish a few properties of continua with the property that they 


contain no non-0-regular convergent sequence of arcs. 


THEOREM 7.4. If M is a continuum containing no non-0-regular con- 
vergent sequence of arcs then 

a) M contains no simple closed curve. 

b) If M is arc-wise connected it is a dendrite. 

c) For each point p the set D, consisting of all points q of M such that 
there exists an arc pq in M is a dendrite (Dp, =the maximal local connected 
subset of M containing p). 

d) The sets |D,| form an upper semi-continuous decomposition of M.® 


Proof. a) has been mentioned several times before. b) If M is arc-wise 
connected, then 1 is uniformly locally connected. For if not there exists a 
number e, a sequence [8;]—> 0, and a sequence of point pairs (pi, qi) such that 
p(pis gi) <8; but pi + gi is not contained in a continuum of diameter less 
than «. In particular, there is an are pig; with diameter <«. Clearly no 
subsequence of these arcs converges (-regularly contrary to hypothesis. Now 
if M is locally connected, then by Theorems 7.3 and 7.33 we conclude that 
M is a dendrite. c) D, is clearly connected. Also Dy is closed; for consider a 
sequence [gi] of points of Dp converging to go. By definition there exist arcs 
gipi and we can suppose we have chosen a convergent subsequence. Now by 
hypothesis [qipi] converges 0-regularly and hence the limit set is an are *® 
containing gj and p. That is goe Dp» which is thus closed. Now Dy) is a 
continuum with the property that every convergent sequence of arcs converges 
0-regularly. Since D, is clearly arcwise connected, we conclude from b) that 
D, is a dendrite. d) It is clear that the sets Dp are disjoint; that is, if qe Dp, 
then Dg = D;. Also the sets [D,] cover M and are compact by c). Finally 
the collection is upper semi-continuous, for consider [Dp,] such that lim. inf. 
[Dp,]- Dp, «0. Let q be contained in lim. sup. [D»y,], then there exists a 
subsequence [D,,,] converging to a point set D containing g. Now the con- 
vergence must be 0-regular otherwise we could find a non-0-regular convergent 


(1925), p. 416. 
1° See G. T. Whyburn, “On sequences and limiting sets,” Fundamenta Mathe- 


maticae, vol. 25 (1935), p. 416. 


- 
| 
ly, 
| 
rly 
le 
of 
ite 
Lot 
n- 
he 
ce 
eS 
a 
of 
eT 
a 
6 
9 
a 
a 
l- 

15 See R. L. Moore, Transactions of the American Mathematical Society, vol. 27 


94 PAUL A. WHITE. 


sequence of arcs. Hence D is a dendrite by Corollary 6.32. But D contains 
ps and q; hence qe D,, which is sufficient for upper semi-continuity. 


THEOREM 7.5, A necessary and sufficient condition that a compact set 
M contain no non-0-regular convergent sequence of arcs is that corresponding 
to every « > 0 there is a 8 > 0 such that if p and q are two points of M with 
p(p,q) < 8, then every arc joining p and q has diameter < «. 


Proof. Suppose that M contains no non-0-regular convergent sequence 
of arcs, but that corresponding to some « > 0 there is a sequence [8;] 0 and 
a sequence of point pairs [pi, gi] such that p(pi, qi) < 8; but there is an are 
piqi of diameter = « for each 7, Clearly this sequence of arcs does not contain 
a 0-regular convergent subsequence contrary to our hypotheses. 

Conversely, suppose that corresponding to each e > 0 there is a 8 > 0 with 
the properties of the theorem. Now consider any convergent sequence of arcs 
[aib;]. If the convergence were not regular, there would be an e >0 and a 
sequence of point pairs pi + qi C aud, such that p(pi, qi) < 8 where [8;] > 0, 
but such that the subare piqi has diameter =. Clearly corresponding to this 
e > 0 there is no § > 0 with the properties of the hypothesis ; thus the sequence 
[aib;] must converge 0-regularly. 


THEOREM 7.6. Jf K,° is locally compact, then corresponding to each 
point P of M there is a neighborhood of P such that every locally connected 


subcontinuum is a dendrite. 


Proof. Suppose K,° is locally compact, but that in each of the neighbor- 
hoods U; closing down on P there is a locally connected subcontinuum which 
is not a dendrite. Thus there is a simple closed curve C; in each U; and 
8(Ci) +0. This of course implies that [Ci] => P. Now corresponding to 
any neighborhood U, of p there exists a number n such that Ko’: K,°-hx(Cn) 
C U,. For if this were not so there would be an are P; of Ci; such that 
pieC(U,). Again 8(P;) 0; and hence [Pi] =P; that is, [pi] > p. But 
this is impossible since C(U,) is closed and pe Uy. Now consider the set 
Ko': Ky°-hx(Cn) C Up. This set consists of all subares of the simple closed 
curve C, and hence contains a convergent sequence of arcs no subsequence of 
which converges 0-regularly. This subsequence generates a subset of Up which 
has no limit point, hence Uy is not compact. But U» was any neighborhood 
of p, hence the local compactness at p is violated. 

The following two corollaries follow immediately. 


CoroLuary 7.61. Jf K is locally compact, then the conclusion of Theorem 
holds. ‘ 


ins 


set 


ing 
ith 


nce 
ind 
are 
ain 


‘ith 
Tes 
da 
>0, 
his 
nce 


ach 
ted 


rich 
and 


rem 


r-REGULAR CONVERGENCE SPACES. 95 


CoroLLary 7.62. If M is locally connected and K,° (or K) is locally 
compact, then M is locally a dendrite. 


THEOREM 7.7. Jf M is locally a dendrite, then K,° is locally compact. 


Proof. The hypothesis that M be locally a dendrite implies that every 
component of J/ contain only a finite number of true cyclic elements each one 
of which is a linear graph. Thus M contains only a finite number of simple 
closed curves. Now consider any p in K; hence P is a locally connected con- 
tinuum. We can find an e-neighborhood of P whose closure contains those 
simple closed curves of M and: only those contained in P. Denote by L the 
subset of K,° corresponding to the collection of locally y* (s 71) connected 
subcontinua of U.(P) which contain all the true cyclic elements of P. L is 
open for if not there would be a sequence [Pi] > P, where we can suppose 
all the P; are in U.(P), but each P; does not omiaie at least one point of 
some simple closed curve of P. But G. T. Whyburn has shown ** that every 
simple closed curve of P would have to be the 0-regular limit of a sequence 
of simple closed curves in the [P;]. Thus since the total number of simple 
closed curves is finite, almost all [P;] would have to contain the simple closed 
curves of P contrary to hypothesis. 

Finally Z is compact. To see this consider the following decompositior 
G of M. The set g will be an element of G@ if it is a true cyclic element of a 
component of J or a point not contained in one. This decomposition is upper 
semi-continuous since the number of true cyclic elements is finite. Let M’ be 
the hyperspace associated with the decomposition. Clearly each component 
of M’ is a dendrite. Now consider any infinite sequence of points [pi] in L; 
that is, a sequence [Pi] each one of which contains the same simple closed 
curves as P. Suppose [Pn,] is a convergent subsequence with limit P, and 
[P’n,] are the corresponding sets in M’, then [P’n,] ee since all sets are 
subsets of a dendrite. This implies the regular convergence of [Pn,] since the 
elements of the decomposition of M which are not points are all fixed in the 
convergence, and hence [pn,| p. 

This shows the local compactness of K,° at p, for if Vp is any neighbor- 
hood of p, then there is a neighborhood Wy, such that W,CV,. Also Wp: L 
is open and W,: ZL C W,-E is a compact subset of Vp. 


THEOREM 7.8. Kn,o,,....0 (% >1) will be complete if and only if M 


is finite. 


17 See reference in footnote 1°, p. 413. 


= 
to 
hat 
set 
sed 
of 
ich 
ood 


96 PAUL A. WHITE. 


Proof. The proof follows the same pattern as Theorem 7.2 except that 
here we must show that [A;] is a Cauchy sequence, Aj = @i + Gis + °° * + Gian. 
This follows immediately from the fact that the f;;(Ax) = 0 for as many of 
the and j as we wish and for almost all #, where the fi; are the functions 
used in defining the metric. 

The following two corollaries are immediate. 


7.81. If n > 1, a necessary and sufficient condition that 
be complete is that M be finite. 


CoroLLary 7.82. A necessary and sufficient condition that K,° be com- 
plete is that M be finite. 


THEOREM 7.9. Jf M is locally connected, a necessary and sufficient con- 
dition that K,° be complete is that every component of M be a dendrite. 


Proof. The proof follows the same pattern as Theorem 7.8 and follows 
from the fact that a sequence of subarcs of a simple closed curve which con- 
verges to the simple closed curve is a Cauchy sequence. 

Thus the hypothesis of completeness imposes about the same restrictions 


on K as does compactness. 


UNIVERSITY OF VIRGINIA 
AND 
LOUISIANA STATE UNIVERSITY. 


UNIVERSAL FUNCTIONS OF POLYGONAL NUMBERS, II.* 


By W. GRIFFITHS. 


1. Introduction. The universal functions of polygonal numbers of order 
m+ 2, with the sum of the coefficients in each function equal to m -- 3, are 
determined in this paper. If the sum of the coefficients is at most m + 2 the 
universal functions have been determined. A necessary and sufficient con- 
dition that the universality of a function of weight m-+ 3 be not implied by 
that of a function of weight at most m + 2 is proved. For each m= 3 a uni- 
versal function of weight m-+ 3 satisfying this condition is exhibited. 

The general development of this paper is suggested by the paper to which 
reference has been made. Certain notations and facts established in that paper 
are used. Many proofs are so similar that they are not presented here. 

The polygonal numbers of order m-+ 2 are defined, for m a positive 


integer, by p(x) = m(a?-—2x)/2 with 1, 2,:--. ._Here m=3, 
for the reasons stated in [. 'The coefficients a,,- --,a, in the functions 
f = Gn) +: npn are positive integers to be determined. 
Also 1 Sa, S++ Sd, and, by definition, ASkSn) 


and w=w,. It will be proved that, if w= m+ 3= 6, then f is universal 
only if f satisfies (7). (8), (9), or (10). It will also be proved that f is 
universal if f satisfies (7), (8), or (10). If f satisfies (9) then f represents 
every positive integer A except perhaps when 140m + 62 < A < M, where 
M depends only on m and f. M is evaluated in Theorem 4. The methods 
of this paper are applicable to the verification that f actually represents these 
integers A. Verification is practically certain, but the extremely arduous 
verification was not carried beyond 140m + 62. 


2. Necessary and sufficient conditions that f represent integers less 
than 8m-+-9. The following lemmas and theorems have been proved by 
methods similar to those used in the proofs of the corresponding facts in I. 


LemMA 1. Let w=m-+3. Then f=0,:--,m-+3 if and only if 
f= (1,02, and Sti +1 (2SkS7n). 

* Received October 30, 1942. 

+L. W. Griffiths, Annals of Mathematics, vol. 31 (1930), pp. 1-12. This paper 


will be cited as I. A misprint is corrected here. 


97 


~ 


t 
f 
0 
| 


L. W. GRIFFITHS, 


LemMA 2. Let w=m+3. Then if and only 
of f = (1,1,a3,- > and Sw, (83SkZN). 
LemMMA 3. If m= 4 and w =m 3, then f 


if and only tf f satisfies (1) or (2): 
(1) - -,an), Ay S Wey (BSkSn), 
(2) fm (1,- 


Theorem 1 follows from Lemmas 1, 2, and 3, if m2 4; if m=3 it 
follows from Lemmas 1 and 2, and direct verification for the integers 


2(m + 3. 


THEOREM 1. Let w=m+3=—6. Then if and 
only if f satisfies (3) or (4): 


(3) f= (1,1,1,3) 


(4) f=(1,1,43,- ds = 1, 2, (45k5 


If f satisfies (4) with a; = 1, then f = 3m + 3,:--,5m+ 6. If f satis- 
fies (3) then f~4m-+ 7%. If f satisfies (4) with a; = 2, and if there exists a 
coefficient a; such that = — 1 > 3, then f ~4m + 6+ 
But if no such coefficient a, exists then f=3m-+ 3,---,5m-+ 6. These 


facts prove Theorem 2. 


THEOREM 2. Lel w= m-+3= 6, then f =0,- -,5m-+ 6 af and only 


if f satisfies (5) or (6): 


(5) 
(6) f= * == 3, n=4 or ay = —— 2 


(5SkSn). 


If f satisfies (6) with a, = 3 and n > 4, or if f satisfies (5) with a; = 1 
and a, = 3 and n > 5, then f5m-+ 9. Otherwise the functions (5) and 
(6) represent the integers 5m-+-7,---,8m-+ 8, except that (1,1, 2,3) 
~%m+10. These facts prove Theorem 3. 


TueoreM 3. Let w=m+3=26. Then f=0,:--,8m-+8 if and 
only if f satisfies (7), (8), (9), or (10): 


a4; 3 if n>5, 


98 

J 
i 


it 


ers 


nd 


UNIVERSAL FUNCTIONS OF POLYGONAL NUMBERS, 


(8) f= 
(9) f = (1,1, Ge SWer—2 (5SkSn), 


(10) f= (1,1,2,2). 


3. Universality of the functions in Theorem 3. To prove that a func- 
tion f which satisfies (7) with a; 1 represents every positive integer it is 
sufficient, by Theorem 3, to establish Theorem 4 and to verify that, if A is an 
integer such that &m+8< A < 44m-+ 40, then A is represented by f. 
Further details in the proofs of Theorem 4 and 5 are not presented here, 
except to remark that the proof of Theorem 4, if f satisfies (9) or (10), 


uses auxiliary lemmas proved by Dickson ? and by myself.* 


THEOREM 4. Let f satisfy Theorem 3. Then there is an integer M, 
depending only on m and f, such that f represents every integer = M. If f 
satisfies (7) with a; =1, then M = 44m + 40: af f satisfies (7) with as = 2 
or 3, then M = 142m + 208; if f satisfies (8), then M = 387m + 108; if f 
satisfies (9) or (10), then M = (11d? — 55d + 74)m + (22d — 70), where 
d==6, 8, 10, 12, 14, 16 according as f ts (1, 1, 2, 2), (1, 1, 2, 2, 2), 
(1, 1, 2, 2, 3,- -) or (1, 1, 2, 2, 2, 3,-- -), (1,1, 2,2, 4° °°) of 
(1, 1, 2, 2, 2,4,°--), (1, 1, 2, 2, 2,5,°°-), (1, 1, 2, 2, 2, 6,° °°). 


, 2 
THEOREM 5. If f satisfies (7), (8), or (10), then f is universal. If f 
satisfies (9), then f represents every positive integer A except perhaps when 


140m + 62<A<M. 


4, Universality not implied by that of a function of weight m+ 2. 
Theorem 5 and the universal functions of weight at most m + 2, found in I, 
show that a function of weight m+ 3 has its universality implied by that 
of a function of lower weight if, and only if, the coefficients of the function 
of weight m-+ 3 are obtained from the coefficients of a universal function 
of weight m + 2 by adjoining a coefficient 1. No function (8), (9), or (10) 
ean be so obtained. The list of universal functions when w= m+ 2 omitted 
the function (1,1,1,1,3). Hence a function (7) can be so obtained if and 
only if either it is (1,1,1,1,1,3), or it is one of the functions (1,1, 1,1, 2,-- -) 
with n = 5 which does not have a coefficient a, such that a, = wa_, — 1, or 


*L. E. Dickson, American Journal of Mathematics, vol. 56 (1934), pp. 513-528. 
*L. W. Griffiths, American Journal of Mathematics, vol. 55 (1933), pp. 102-110, 
and vol. 58 (1936), pp. 769-782. 


99 
9 
is- 
sa | 
an. 
ose 
ily § 
1 
nd 
3) § 
nd 


100 L. W. GRIFFITHS. 


it is one of the functions (1,1,1,1,1,1,- - +) with n 26 which does not have 
such a coefficient an, or it is one of the functions (1,1,1,1,1,2,---) with 
n = 6 which does not have such a coefficient an. 


THEOREM 6. The unwwersality of a function of weight m-+ 3 is not 
implied by that of a function of weight m + 2 if and only tf it satisfies (8), 
or (9), or (10), or if it is one of the functions (7) which is not in the 
preceding list. | 


If m = 10 the function f defined as follows satisfies Theorem 6. Define 


gq and r by m—2—4q+ 7, with 0OSrS3, and let a4, =: =—1, 
dg = 4, If m=3,---,9 the following func- 


tions, respectively, satisfy Theorem 6,‘ since they satisfy (8): (1,1, 2, 2), 


NORTHWESTERN UNIVERSITY. 


. 


THE PROJECTIVE THEORY OF SURFACES IN RULED SPACE, II.* 


By CHENKUO Pa. 


PART C. 


Asymptotic Ruled Surfaces R, and R,. 


1, General theory. Consider on a surface § a curve C; through a point 
P; the direction of the tangent PT is determined by the value du/dv.1. The 
tangents of the asymptotic curves v = const. (w= const.) drawn from the 
points of Cy constitute a ruled surface known as the asymptotic ruled surface 
k,(R.) of S along C:. Tn order to discuss the behavior of these ruled surfaces 
in the neighborhood of the given point P, it is convenient for us first to 
determine in our coordinate system the fundamental quantities at P. For 
the subsequent discussion we notice here two sets of formulas. 

It is clear that the non-homogeneous codrdinates of an ordinary point (Z) 
of Ff, are of the form (6), so that the differential equation of the curved 
asymptotic lines are given by the expression (7). Our purpose is to normalize 
the codrdinates of (7) so that the condition (3) should be satisfied at P. 
To this end, we introduce at the given point a transformation defined by 


(67) v= (Bt )oz, = y — (Bt) 


It is readily seen that the fundamental differential equations (1) remain 


unaltered and at P 
(68) =], Yu == = = 0, Dw = Bt’, Y uv = — pt. 


Since after this substitution the point (@) may be written in the same form as 
(6), namely, 
x —= ALy c= 2’, 


the differential equation (7) of the curved asymptotic lines of 2, also remains 
unaltered. Since the fundamental differential equation of R, must be com- 


* Received June 1, 1942. Parts A and B of this paper appeared in this Journal, 
vol. 65 (1943), pp. 712-736. 

1 We denote, for brevity, du/dv by t, d?u/dv? by t’, the osculating plane to C, at P 
and the tangent plane to S at P by z= ete. 


by 7, 


101 


102 CHENKUO PA. 

pletely integrable we can introduce a new asymptotic parameter w in place 
of A such that at P 

(69) (0/dw)o = 1. (0°A/dw?) o = Oy.” 


The advantage of choosing w in this way is that the conditions (3) still hold 
for R, at P, so that 


(70) = Yr = = 1, = Zu = = Yur = QO. 


For the sake of convenience («’) is replaced by (x) and therefore (68) 


by the following 
Ly = Yu = = 1, Yu == Ly 0, Luv = Yur, = — Bt. 


After this reduction the method which we have so far used for the computa- 
tion of the fundamental quantities of Q," is applicable, without any alteration, 
to the surface R,. Let the fundamental differential equations of FR, be 


+ Bite, Tuv = + 
then we can easily show that at P 
— (0°A/dw?) /(0A/Ow) = Oy, Bi 
Putting, for brevity, 
N = + By + 2Bvt + (Bu-— BOu)t? + Bl’ + Bt(O.— Bt’), 
we have the differential equation (7) in the form 
Ay = — A(Out + Bt?) — Et. 


Now, denote by X, the derived function of a function XY (X ~a, y. z. B, y; 
6,, 0, ete.) along a curved asymptotic line of R,; by virtue of the equation (7) 
we have 

ley -+ MLy + ALuv, 


where the quantities 1, m are given by 
1 = + ¢+ A, = m—1-+ 


In order to calculate the fundamental quantities of R,, it is useful to 


establish here some relations at P: 


m 


?For the following discussion it is unnecessary to choose the values of 0"\/dw 
(for m23) at P. 


to 


m 


gw 


THE PROJECTIVE THEORY OF SURFACES IN RULED SPACE, II. 103 


Awwr = N — 0,7t — BOutl?, 

= — tN — (Ouut? + Burt + Out’ + 2Btt’ + But? + Bet?) + (Oy + Bt)*t2, 
m=1, my = Bt, mow =— Bl? (Bu BOu)t? + Brt + BY, 
Muy = BOyt® + — 3Btt’ — 2Byt® — 2B, t2, 

low = BOut® + — tN — 2Btt’ — B,t* — B,t2, 

loy = t2N — AvvBt? + 2But* + + 


By means of the identity 6x. = (0°A/dw*) /(@A/dw) we obtain at P 
6, we = Awwr + 6y7t BO,t?. 


Substituting the value of Awwy at P in the right-hand side of this equation we 
have at P 
Ni 


Tov = (y + Bl?) tu + (Ov — Bt?) av. 
On the other hand, the fundamental differential equations give 
Loy + yitw- 


Taking account of the values of (#-), (€w) and (au), (av) given by (70) and 
(68’) we have the two following relations: 


Av = 6, — 
After a simple calculation we find the values of (Zrry) at P as follows: 


Divi + 21 Aut Yer — t? (Our By) 
A (ly + + Aw — Out?) + (*) 


aud on the other hand by using the differential equation we have that at P 


= (yie + (*) Ly + 


Use of (70) and (68’) gives one of the required relations, namely, at P 


yiv = yo + 3Bul* + + 


Moreover, the value of (®rrw) at P may be represented as follows 


1, 
Differentiation of iclds the 
relations 
7) 


104 CHENKUO PA. 


Fovw = [low + lw + Ou + muy + dwt (Our By) + Av (yu + yOu) 
(Li Mat Acw a. + Luv Vy. 


Ii we differentiate the fundamental differential equations for (Z) we have 


= + w) Fw + Ly + Low. 
Therefore 


yu + Byt + — 3B,t? — 2Bul® 3Btt’. 


Yiw 
Thus we have obtained the fundamental quantities of the surface R, at the 
given point P up to and including the neighborhood of order 4 of the surface. 


In summary, we have the following formulas: * 


Ou—Ou, O10 —= Oy — Bl’, 
yiu = yu + Byt + — 3Bol? — 2But® — 

= yo + BBul* + 4B + 

= Our + By + 2Bvt (Bu — BOu)t? + Bt’ + — . 


It remains to determine the relations between the second order of a curve 
C, when it is referred to the codrdinate systems defined by S and by fi, 
respectively, at P.* According as the curve C; is referred to the coordinate 
system of S or R, the equation of the osculating plane a of Cr is given by 


— ( Bt Y t0, — t76, —t’)z 0 


or 

— 2¢,2; + — Outs? — U1) 2, = 0, 
where 
(72) Bz, y=y— 


and ¢,, ¢’, are the corresponding values of ¢ and t’ of C; in the new system of 
reference. From the relation (72). it is evident that ¢, —¢, so that the con- 
dition for the equivalence of these two equations is 


Thus we have 


and conclude that the equation of any geometrical element of R, defined by 
the neighborhood of fourth order at P may be found by applying the trans- 


In the following we write, for the sake of convenience, u in place of w. 
* We define a codrdinate system of #, at P in a similar manner as that of 8. 


he 
Ce, 


ve 
) 
25) 


te 


THE PROJECTIVE THEORY OF SURFACES IN RULED SPACE, II. 105 


formation (71), (72) and (73) to the corresponding equation of the same 
element defined by S in our system of reference. As an illustration, taking 
the equation of the osculating quadric of R, at P 


461uv21? = 0, 


and making use of the method just mentioned, we find the equation of one of 
the asymptotic osculating quadrics of S associated with C; as was shown in 
Part A. 

The corresponding values of the fundamental quantities Bo, ye, O2v 
etc., of #, may be obtained by interchanging in the equation (71) the quanti- 
ties u and v; B and y; ¢ and ¢*; t’ and — ¢-*?’ ete. We do not write them here 
in full. For the following discussion we put 


74 
and 
(75) = yu/Y + bu; pe 26, Bu/B; 


(v1 = + Ov, = —- 


2. A pencil of lines associated with a curve on a surface. In the 
present section we define a pencil of invariant lines which characterize the 
second order of a curve (; of 8S. It is well known that by means of the second 
order of a space curve we can not define any element other than the osculating 
plane. Here we shall define other elements by making use of the relations 
between the curve and the surface. 

The expansions of the curved asymptotic line C, of R, at P are known to be 


— + Ciys* + (5), 
= — + — + (6). 
With the aid of the transformation (72) the above equations can be written 


in the form 


== 


(76) . (Qi + Bi )y® + (C; y* + (5), 
lz= — YoyQiy* + + %Ci— Yori) + (6), 


where Q,, ¢, are defined in (74) and (75) respectively and 
C; Vs (— + + + Yiu) + ir}. 


Similar expansions may be obtained for the curved asymptotic line C, of R 


l- 
y 


106 CHENKUO PA. 


by interchanging in (76) the quantities x and y; u and v; B and y; Q, and Q, 
and the indices 1 and 2, etc. 

From (71) it follows immediately that when C; touches a Darboux 
tangent at P then the asymptotic curve C, has a point of inflection at P and 
consequently that any Darboux curve is a flecnode curve of its asymptotic ruled 
surfaces R, and P., as has been pointed out by Palozzi.* In fact, both the 
asymptotic ruled surfaces R,, R. associated with C; and the given surface S 
have at P equal projective linear elements in the direction ¢, since 


Wy, (dv?/du) = + B(du/dv)*) (dv?/du) 
= (ydv* + Bdu*) /2dudv = %4B.(du?/dv). 


Moreover, the contact invariant of C, with respect to the asymptotic curve v 
of S at P is equal to y:/y =1-+ (B/y) (du/dv)* which has been interpreted by 
Bompiani as the contact invariant of two certain conics.° — 
Obviously, the curves C, and C, have a common osculating plane at P. 
According to Bompiani there are three principal points determined by this 
5 
pair of curves’ and in this case they are collinear. The straight line con- 
taining them will be called the second principal line of C; at P, and the dual 
5 I 
line in the correspondence of Bompiani * the first principal line of C; at P. 
After a simple calculation we find the equations of the second principal line, 
namely, 
(77) 40.0 + 40.y+1=0, z=0, 


and those of the first principal line 
(78) r+ (40: + Bt )2=0, y+ (4Q2+ yt*)2—0. 
We can now prove the following 


THEOREM. The pangeodesic of a surface is characterized by the property 
that the first principal line at every point lies on the osculating plane at the 
same point. 


°G. Palozzi, ‘Una proprieta caratteristica delle tangenti di Darboux,” Rendiconti 
dei Lincei, (VI), vol. 13 (1931), pp. 483-488. 

°E. Bompiani, “Gli invarianti proiettivi nella teoria delle superficie, I—Ricos- 
truzione rapida della teoria delle applicabilita proiettive,” Rendiconti dei Lincei (VI), 
vol, 24 (1936), pp. 323-332. 

*E. Bompiani, “ Invarianti d’intersezione di due curve schembe,” Rendiconti dei 
Lincei, (V1), vol. 14 (1931), pp. 456-461. 

® Cf. Buchin Su, “ On the intersection of two curves in space,” T'éhoku Math. Journ., 
vol. 39 (1934), pp. 226-232. 


THE PROJECTIVE THEORY OF SURFACES IN RULED SPACE, II. 107 


1 Q. As a curve on a surface in metric space is a geodesic when and only when 
the osculating plane passes through the surface normal at the same point, the 

yOUX above definition for pangeodesics furnishes a generalization in projective 

and geometry. 

uled Moreover we have the following theorem : 

the 

e 8 When the curve C; varies but remains tangent to a given tangent direction 


t, the second principal line always passes through a fixed point. This point 
lies on t when and only when t is a tangent of Segre, and then the polar line 
of P with respect to the triangle formed by three points corresponding to three 
tangents of Segre is the second projective normal. Dually, the locus of the 
first principal line is a plane. This plane passes through t when and only 


ev 
ie when t is a Segre tangent and then the polar line of the tangent plane w with 
; respect to the trihedron formed by three planes corresponding to three tan- 
ents of Segre is the first projective normal. 
P g g proj 
. The second order of a curve of Segre satisfies the following differential 
on- 
equations 
ual 
2 , 2 sine 
P. = (yu/y — Bu/B) + — Br/B)), wi (y/B), (t= 1, 2, 3) 
ne, 
where w* = 1 and »1. By using these equations we prove the following 
result : 
The first principal lines of three Seqre curves through P ferm a trihedron 
} g g 
with respect to which the polar line of the tangent plane of S at P is the first 
canonical line c(— 4).° 
Consider the osculating linear complex /, of C, at P whose equation is 
ly ee + Pi = 0), 
he 
where the quantities (p’ij) are the Pliicker codrdinates of a line referred to 
the codrdinate system of R, at P. Making use of the relations (72) we have 
tt 
, 2 2 
(79) P 12 = Pi2— Bt Pos — Btprs, Piz Bt pes = D2. Bt psa; 
Pits P 23 = Pos, = psa. 
By means of (79) the equation of J, is reducible to 
et 
; * For another interpretation of this line cf. my note “ On the quadries of Moutard,” 


appearing in Univ. Nac. Tucuman Revista, A, vol, 2 (1941), pp. 67-77. 


108 CHENKUO Pa. 


Bt? pos — pss + (P: — Bt) pis = 0. 


(80) 
Interchanging the quantities 8 and y; u and v; ¢ and ¢-!; P; and P, and the 
indices 1, 2 gives the equation of the asymptotic osculating linear complexes 


l, of C, at P, namely, 
(81) Pr2 + yt + pss — (P2— pos = 0. 
By means of (80) and (81) we find readily the second directrix of 1, and 1, 
(82) + 3(P2— yt? 2—=0, 
and the first directrix 
(83) + Bt)z—0, y+3(P:— Bt +yt*)2—0. 


From this equation it follows that the osculating plane of a curve Ct pusses 
through the first directrix at the same point when and only when the curve C; 
13 a pangeodesic of S. Moreover, the theorem mentioned above is valid if the 
first principal line is taken in place of the first directria. 

Let us now consider a pencil formed by the first principal line and the 
first directrix at P. We shall call such a pencil the first associate pencil of Ct 
at P and its plane the associate plane. In a similar way we define in the 
tangent plane the second associative pencil and the associate point of Ci. 

A ray Ip in the first associate pencil may be represented by the equations 


[1/(1+ p)][4Q: + Be? + (0/2) (P2— yt + Bt?) =0, 
Ly t+ [1/(1 +p) + yt? + (0/2) (P:— Bl -+ = 0, 


which is determined by the directrix !x, the principal line J, and one value 


(84) 


of a certain double ratio. Similarly we have a ray mp in the second associate 
pencil 


(85) [1/(1 + p) + (p/2) (P: — Bt— yt*) 
+ [1/(1 + p) + (0/2) (P2— yt* — pt?) Jy +1=0, z=0. 


From the preceding discussion it is readily seen that the associate plane 
of a curve C; on the surface becomes the osculating plane when and only when 


C; is a pangeodesic. 
There are quadrics of a pencil each of which has at P a contact of at 


least the fourth order both with C, and (C2. 
(86) — 32 — (4Q2 — yt-*) wz — (4Q. — BE?) + ke? =0, 


where & denotes an arbitrary parameter. 


the 
2X eg 


SES 


1e 
te 


THE PROJECTIVE THEORY OF SURFACES IN RULED SPACE, II. 109 


It may easily be proved that the projectivity P formed by the product 
of the null systems of the asymptotic osculating linear complexes of C, and C2 
at P is an involution. There exist quadrics of two pencils each of which 
passes through the asymptotic tangents and remains invariant under the 
projectivity P. One of these pencils may be represented by 


(87) ty yt*rz + Bt yz + kz? = 0, 
and the other by 


yt")yz + kz? =0. 


(88) ry +2z2+ (P,— Bt)az+ 


where / denotes a parameter. 

We can prove that each regulus of any quadric in the pencil (87) or (88) 
belongs to one of the two osculating linear complexes 1, and J... The common 
polar lines of (87) and (88) are directrices of C+ and those of the pencils 
(86) and (87) are the principal lines. Furthermore the only common polar 
lines of (86) and (88) in the tangent plane belong precisely to the second 
associate pencil. 

We further remark that a ray of the second associate pencil can be defined 
by means of the neighborhood of the 5-th order of both the curves C, and 
at 


3. New covariant curves on a surface. Having thus discussed the 
various properties of the directrix and the principal line of a curve or a 
surface as well as the advantage of introducing them for the definition of 
pangeodesics, we come to a more general consideration of the associate pencil. 
In the following lines we shall see that this pencil of lines plays an important 
role in the study of certain covariant curves of a surface. 

The polar line of mp in the fundamental polarity is a line whose equa- 
tions are 

x + [1/(1 + p)][4Q: + (p/2) (P2— yt* — Bt?) Jz = 0, 
y + [1/(1 + p) + (p/2) (Pi — Bt— Jz = 0. 


This line lies in the osculating plane to the curve C; at P when and only when 


C; satisfies the following differential equation 


— [2t2/(1 + p) + (p/2) — Bt — ] 
+ [2t/(1 +'p) + (/2) (P2 — yt* — Bt?) 


110 CHENKUO PA. 


After a simple reduction we can write the above differential equation in 


the form 


(89) = — Bot?) + — But") 
+ [(1+ p)/(%p —1)] — 7’), 


which defines a system of curves Cp called the deformable curves. From this 
equation, we infer that the Darboux tangents are singular directions for the 
curves of this system and that only the tangent of Segre has the property that 
the osculating plane of a curve on S touching this tangent at P must pass 
through the polar line of m,; with respect to the quadric of Lie. For p=— 1 
the corresponding curve C_, is a pangeodesic. If the tangent t varies at P the 
osculating plane to the corresponding deformable curve Cp at this point en- 
velops a cone Kp of class 6 and, in particular, K_, is the cone of Segre. 

When a one-to-one correspondence of any kind is established between the 
points of two surfaces, either surface may be said to be represented on the 
other. Following this definition and noticing that the asymptotic tangents 
are singular directions for any deformable curve Cp other than pangeodesics 
(i.e. p4—1), we conclude immediately that if two surfaces S and S be 
representable upon each other and if all the deformable curves of two surfaces, 
other than the pangeodesics, of the same kind be in correspondence, then 
all the asymptotic curves of S and S must be also in correspondence, and the 
surfaces are then projectively applicable. In fact, we can take on S and 8 
the same asymptotic parameters (u,v) and the same system of reference we 
have so far used. Let 8,7 correspond on S to 8, y; then the deformable curve 


Cp (pA —1) of S may be represented by the following equation 
(7 + Bt )t’ = (Fu — Bot?) — (Fo — Bul*) + [(1 + p)/(2p— 1) (BE 
which defines, by hypothesis, Cp (p #—1). Hence we have 
B=B, 
Thus we have proved the 


THEOREM. A necessary and sufficient condition that two surfaces should 
be projectively applicable is that the deformable curves, other than pangeo- 


desics, of the same kind on these surfaces be in correspondence. 
We now derive a new class of covariant curves on a surface so as to 


characterize the surface under collineations. 
The principal quadriec of a surface may be represented by 


n in 


this 
> the 
that 


pass 


ESICS 
S be 
Aces, 
then 
| the 
id § 
> we 


urve 


qeo- 


3 to 


THE PROJECTIVE THEORY OF SURFACES IN RULED SPACE, II. 111 
ry -— 32 — — Syoyz + kz? = 0, 


where $2, y. are given by (75) and k& denotes an arbitrary constant. The 


polar line of mp with respect to this quadric is 


a+ {[3/(1 + p)][4Q: + (p/2) (P2 — yt"? — Bt?) ] — = 0, 
y + {[8/(1 + p) + (p/2) (P1 — Bt — yt*) ] — $¢2}2 = 0. 


This line lies in the osculating plane to the curve when and only when 


(90) [2(5p— 4)/(1— 2p) ](y + BE) = 3t(Bul* — yr) + (Brot? — ya) 
+ [(1 + p)/(1— 2p) ] + — 210, + tyre — + — 4y) 
or 
(90’) 2(5p—4) (y+ Be ) + Out? — 6,t) 
= (4— dp) Boot?) + (1 + p) (Bat? — 
+ 61° (1 — 2p) (Bit? — + 4(1 + p) — 


In Fubini’s codrdinates we have 


(907) 2(5p— 4) (y+ Be) [UV + (A log By/du) t? — (0 log By/dv) t] 
= [{(4— Sp)y + (7 — 11p) Bt* |t(0 log B*y/dv) 
— [ (4 — 5p) Bt + (7 — 1p) y] 2 (0 log By?/du) + 4(1 + p) 


n particular, when p= 1e corresponding curves are n by the fol- 
In particul hen p= 4/5, tl li 1 given by the fol 


lowing equation in Fubini’s 
y(@ log By*/du)t? —- B(0 log B?y/dv)t* 4(B7t® — y*) = 0, 


which is independent of the second order of these curves. 

A curve Lp on the surface defined by the differential equation (90) will 
be called a projective curve. It is remarkable that the curves L_, are pangeo- 
desics. The osculating planes to all these curves Lp (for fixed p) through the 
point P envelope a cone Np of class 6 and, in particular, N_, is the cone of 
Segre. 

With the aid of the cone Vp (p ++—1) we may define some new canonical 
lines. 

In a way similar to that we have used for the deformable curves we get 


immediately the 


THEOREM. A necessary and sufficient condition that two surfaces should 
be projectively equivalent is that the projective curves of the same kind, other 


than the pangeodesics, on them be in correspondence. 


| 
—] 
the 
en- 
the 
the 
rents 
| 
—¥’), 


112 CHENKUO PA. 


We shall discuss the deformable curves and projective curves of a surface 


on another occasion. 


4, The correspondence of Segre. Let us define a certain correspondence 
by means of the curved asymptotic lines C, and C, of the ruled surfaces F, 
and F,.'° It is easy to show that the correspondence in question is the polarity 


with respect to the following pencil of quadrics 
(91) ry + + kz? = 0, 


where / denotes an arbitrary constant. In particular we have the - 


THEOREM. The point-plane correspondence of Bompiant defined by the 
pair of curves C, and C, at P coincides with that of Segre when the point ts 


taken on the tangent of C;. 


Concerning the pencil of quadrics (91) we can also establish the following 


THEOREM. If the lines of the regulus on a quadric of the pencil (91) 
intersect the asymptotic tangent v(u). then tt must belong to the osculating 


linear complex of C,(C2) at P, and conversely.™ 


Furthermore we can prove the 


THEOREM. Any quadric of the pencil (91) has a double line contact with 


both the asymptotic ruled surfaces R, and R, at P. 


5. Some covariant quadrics. In a following paper we propose to discuss 
the geometry of the sequence of asymptotic ruled surfaces R,"), (R.") of 
P,(R.), Ry? (BR, ) of (R),---, (R2™) of (RR?) 
We shall call the n-th derived 
A full discussion for the asymptotic 

Here we give only some covariant 


etc. associated with the curve (;. 
asymptotic ruled surface of R,(R2). 
chord surfaces will also be given later. 
quadrics of R,(R,) associated with the curve C;. 

One of the two asymptotic osculating quadrics of the surface FP, associated 
with C; at P coincides with the osculating quadric of the surface R, and the 


other one may be found by our preceding method, namely, 


7° Cf. Buchin Su, loc. cit. 
11 Suppose that two curves C’, and C, intersect at P with the same osculating plane 
If the principal points are collinear, then this theorem is also 


but distinct tangents. 
valid. 


‘face 


ence 
ity 


the 


ith 


uss 

of 
») 
ved 
tic 
int 


ted 
the 


une 
lso 


THE PROJECTIVE THEORY OF SURFACES IN RULED SPACE, II. 113 


219 — 21) + — 2yil?yi2, 
+ [— uv + {t’; —O,y)t + 614) t?} Jay? = 0. 


Making use of the transformations (71), (72) and (73) and reducing, we have 


the equation of this quadric: 


2t* ( ty — 2) + 2ytxz 
+ [— (Ou + By) +y{t'- 


(ye/y — Or) t —-(2yu/ /y + 6,)t° == (), 


which is precisely the osculating quadric of the surface R, at the point P. 


Hence we have the 


THEOREM. The two asymptotic osculating quadrics of R,(R.) associated 
with the curve Cy coincide with the osculating quadrics of R,(Rz) and 
R.(R,) at P. 


At a point P of a surface we can also define two quadrics called the 
asymptotic chord quadrics, as we have obtained the equations in our system 
of reference in Part A of this paper. These quadrics intersect in the asymp- 


totic tangents and a conic, whose plane z is found to be 


(Gye? + — (dyt-* + 
A (yy — t-? + (yu ) 4 (Bu — 
» + —4(yt-* + B)t’}z = 0. 


If we vary the curve C; but keep it tangent to the tangent t, the locus of the 
line of intersection of the plane a and the osculating plane m: of C; at P is a 
plane 8. In particular, for each tangent of Segre at P the plane B always 
passes through the new canonical line ¢(— 2/5). 

It remains to determine the equation of the asymptotic chord quadrics of 
the surface R,(P.) at P. It is evident that one of these quadrics coincides 
with the osculating quadric of A, and the other may be represented by the 


following equation : 


4 {Oruv + 4 (yiv 10) + (yiu + 21° 0, 


By means of the method we have so far used the required equation is found to be 


(92) zy —z + — (5pt — 
+ — (yt? + + + Bet 
— $B Out? + + + (yu + 0. 


|| 

t 

91) 

ing 

8 


114 CHENKUO PA. 


It is readily seen that this quadric is independent of the second order of 
C when and only when ‘he curve C; touches a Darboux tangent at P and the 
three corresponding quadrics have a common point on the following line: 


r+ 4yz2—0. 


By a suitable substitution we get the equation of the corresponding quadric 
of the surface R,. These two chord quadrics have a residue conic of inter- 
section lying on the plane y. We can prove that y coincides with the osculating 
plane of C; when and only when this curve C; is a pangeodesic of the surface. 

If the curve C; varies but remains tangent to a tangent ¢ at P, the locus 
of this residue conic is a quadric 
(93) 36 (ay —z— 4Oyvz®) + 12( Bt? — (y + 4Ouz)z 

+ 12(yt-? — 2Bt) (x + 46,2)2 

— + yt)? + 3 (Bul? + 4Bot + + = 0, 
which belongs to a Moutard pencil.’ For each tangent of Darboux the corre- 
sponding quadric becomes the Moutard quadric. 


THE NATIONAL UNIVERSITY OF CHEKIANG, 
KweEIcuow, CHINA. 


12 The Moutard pencil has first been defined by Buchin Su and Asajiro Ichida. 
See their paper: “On certain cones connected with a surface in the affine space,” 
Japanese Journal of Mathematics, vol. 10 (1934), pp. 209-216. For another definition 
of this pencil compare my paper: “On the quadrics of Moutard,” loc. cit. 


of 


the 


ida. 
re,” 
ion 


A GENERALIZATION OF ASSOCIATE QUADRICS OF A SURFACE.* 


By CHENKUO Pa. 


At a generic point of an analytic non-ruled surface, a quadric called the 
associate quadric has been defined by B. Su.t The asymptotic curves of a 
surface belong to linear complexes if, and only if, the associate quadric of the 
surface is fixed.*? The equivalence of this quadric and the second quadric ¢ 
in Godeaux sequence has been pointed out by L. Godeaux.’ S. Finikoff has 
given a quite different definition of the same quadric.* The object of this 
note is to generalize the notion of the associate quadric of a surface from the 


standpoint of view of Finikoff. 


1. For the subsequent discussion it is convenient to utilize the normal 
coérdinate system of Cartan® at a given point M of a surface. Let 
{MM,M.M,} be such a tetrahedron of reference; then the conditions of 


immovability of a line (pi;)® are easily found to be 


= A*Bpoy + K (pros prs) 


Op12/0v = — ypis —- (0 log B/Ov) pro + A? (pros — prs) + B’y pos, 
= —- Bpy2 — (0 log y/du) pis — B* (prs + pos) + 
Op13/0v = B’yps4 — (pos +- Pas); 
= — pig -— B* pos — K pa, 
0714/00 = — pro — A* — K pias, 
Ope; /0u = —- Piz — B?piog + 


= pro + A* ps4 — K pos, 


* Received June 8, 1942. 

?Buchin Su, “On the surfaces whose asymptotic curves belong to linear com- 
plexes II,” Téhoku Math. Journ., vol. 40 (1935), pp. 433-448. 

? Buchin Su, loc. cit. 

*L. Godeaux, “ Remarques sur les quadriques associées aux points d’une surface,” 
Journ. Chinese Math. Noc., vol. 2 (1937), pp. 1-5. 

‘S. Finikoff, “Sur les quadriques de Lie et les congruences de M. Demoulin,” 
Recueil Math. de Moscou, vol. 37 (1930), pp. 48-97. 

5° Cf. S. Finikoff, Comptes Rendus (1933), pp. 883-885; Buchin Su, Téhoku Math. 
Journ., vol. 41 (1935), pp. 203-215. 

*We denote the Pliicker coérdinates of a line joining two points (y) and (2) by 
— ey; 7=1,---,4) and (y), (2), (p,;) all refer to the local 
coérdinates. 


115 


ric 
er- 
ing 
cus 
‘Te- 


116 CHENKUO PA. 


= — prs — pros + (0 log y/0U) pros, 
Opss/0u = — Bpos, 
Opss/00 = — Pra + Pos + (0 log B/Ov) pss, 


where the quantities K and K are defined by 


2K = By — @ log B/dudv, 2K = By — 0° log y/dudv. 


The conditions of integrability in this codrdinate system take the form 


0A*/du = K (0 log KB/dv), 0B*/dv = K(0 log Ky/du), 
A[@(AB) /ov] = By) fou]. 


For the sake of convenience we put 
4N = By) = A[0(AB) /dv], 


so that the projective minimum surface is characterized by N = 0. 
The equations of the asymptotic osculating linear complexes R, and FR, 


of the asymptotic curves u and v at the point M are found to be 


R, = pis — Pos = O, Re = prs + pos = 0 
respectively. 
Making use of the conditions of immovability, we have 


OR, /0v = — 2(pre + OR, /Ou = — 2K pas, 
0°R, /dudv = — 2K (0 log BA /0v) pss = — 2(0A*/0u) psa, 
#R,/du? = -— 2[ — (0 log y/0v) — yprs + log BA?/0v) pss], 


@R, /dv? = 2[BK po, — (0K /0u) pas]. 


2. Let C be a curve on the surface passing through the point M. At M 
and two consecutive points M’, M” on C there are asymptotic osculating linear 
complexes R,, R,’, Ry”, which have a regulus in common. We shall call this 
regulus the u-regulus of C at M. In a similar way the v-regulus of C at M 
is defined. Suppose that all the w-reguli of the curves touching a given tangent 
t of S at M have a straight line in common, then we call this line the u-char- 
acteristic line of S in the direction ¢ and we define similarly the v-characteristic 
line. If a u-characteristic line is also a v-characteristic line in the same 
tangent t, this line is then called a characteristic line of S in the direction t. 

By hypothesis, the u-regulus of C consists of the common lines of the 


three linear complexes 


A GENERALIZATION OF ASSOCIATE QUADRICS OF A SURFACE. 117 
0, ak, 0, ar, 0, 


where d denotes the differential along C. Making use of the calculations given 


in 1 we may rewrite these equations in the form 


Pis-— Pos = 0, 
Pro + [A? + K(du/dv) ] ps4 = 0, 
[ B° — (B/y) K (du/dv)*| pos — pis + Lpss = 0, 


where we have set 
N/2By + K/y O(log B°K?/dv) du/dv + 1/y (0K /du) (du/dv)? + (K/y) d*u/dv. 


In consequence, the quadric B, containing the u-regulus of C is easily found 


to be 


— |B? — (B/y) K (du/dv)*|y2” — [A? + K (du/dv) ]y;? 
+ + K(du/dv) — (B/y)K (du/dv)*]y? + L(ysys — y2ys) = 9, 


where 4:1, Y2, Ys, ys denote the local codrdinates of a point P with respect to 


the tetrahedron of Cartan, viz., 
P= y,M + yoM, + ysMe + 


Similarly, we find the equation of the quadric B, which contains the v-regulus 
of C at M: 


yi” — [A? — (y/B) K (dv/du)*]|y,? — [B? + K(dv/du) 
+ [B? + K(dv/du) |[A* — (y/B)K (dv/du)*?]y2 + — yoys) = 0, 


where we have set 


M = N/2By + K/B O(log y’K*/du) dv/du 
+ 1/B (0K /dv) (dv/du)* — (K/B) (dv/du)* (d*u/dv?). 


In particular when C is an asymptotic curve v of the surface the quadric B, 
coincides with the associate quadric of Su at the gwen point. 

If all the asymptotic v (w= const.) belong to linear complexes, so that 
K = 0, then all the quadrics B, (independent of the curves C) coincide with 
the associate quadric, and in this case the associate quadric is stationary along 
any asymptotic curve v. In fact, taking account of the equations 


Ry = pis — Pros = 9, OR, /0v = + A* pss = 0, 
07R, = pos (N/2By) pss = 0, 


m 
iT 
1S 
ic 
t. 


$ 
‘4 


118 CHENKUO PA. 


and K = 0, we have 
PR, /dv* (B?/B) Kpss 0, 


whence the result follows. Conversely, if the associate quadric is stationary 
along any asymptotic curve v, then all the asymptotic curves v must necessarily 
belong to linear complexes. This result may be demonstrated directly by 
means of the calculation we have so far used or derived from a theorem which 
we have recently established.” Thus a theorem of Su* has been improved 


into the following form: 


If all the asymptotic curves v of a surface belong to linear complexes, then 
all the flecnode tangents of each curve v le on one and the same quadric 


(depending upon u alone), and conversely. 


In particular if the asymptotic curves of both families belong to linear com- 
plexes, all the flecnode tangents of the surface must then lie on a fixed quadric. 
Let us now consider the general case. For a given tangent ¢ and various 
curves touching ¢ at M the curve of intersection of B, and B, always lies on a 


quadric B: 


yi” — — + (A?B?— KK) + (N/2By +1) — yoys) = 9, 
where 


P = (KKdudv/(Kydv* + KBdu’) ) 
[(d log B°K?K /dv) dv + (0 log /du) du). 


It should be noted that the quadrics B obtained by varying ¢ always pass 


through a curve given by the equations 


9243 = 0, 
( — B’y.? — + (A*B? — KK) y,? = 0. 


For each curve defined by the differential equation of the second order 
L? — 4(A? + Kdu/dv) [ B* — (B/y) K (du/dv)*?] = 0, 


the quadric B, decomposes into two planes. And for each curve defined by the 
differential equation 


*Cf. Chenkuo Pa, “The projective theory of surfaces in ruled space,” American 
Journal of Mathematics, vol. 65 (1943), pp. 712-736. See especially the last paragraph. 
§ Buchin Su, loc. cit. 


. 
‘ 
1 
q 


he 


an 


ph, 


A GENERALIZATION OF ASSOCIATE QUADRICS OF A SURFACE. 119 
L = 0, 


the tetrahedron of Cartan at M 1s self-polur with respect to the corresponding 
quadric B,. If a curve of one of the above classes be tangent to the curve v 
at M and has the contact invariant 1—h (i.e. du/dv =0, d?u/dv? = yh) 
then we have 

(N/2By — Kh)* — 4A*B? = 0, 
or 


(N/2By) —hK =0. 


In the case h = 0, surfaces of the first class have been first found by Su,° 
and those of the second class are precisely the projective minimum surfaces 
which may be geometrically defined by means of associate quadrics, as was 


shown by Su.?° 


‘ 8. If we consider all the curves on the surface touching a fixed tangent ¢ 
at M, the corresponding quadrics B, all pass through a quadrilateral 3, which 
may be represented by the equations: 


— = — |B? — (B/y) K (du/dv)*]y.* = 0; 
— = 9, yi? — (A? + Kdu/dv)y;? = 0. 


A similar result holds for the quadrics B,. Thus we arrive at the following 


THEOREM. At a generic point M of a surface S the u-characteristic lines 
(v-characteristic lines) in a given tangent direction t form a quadrilateral 
2, (32) on the quadric of Lie. For each asymptotic tangent one of these 
quadrilaterals decomposes into the asymptotic tangents and the other coincides 
with the quadrilateral of Demoulin. They coincide with each other only when 


t lies in one of the directions 
BKdu' + yKdv> = 0 
and then the quadric B becomes the quadric of Lie and vice versa. 


For an isothermally asymptotic surface the directions just defined are 
those of Darboux. 

For the direction A*dv + Kdu 0 two sides of 3, are the asymptotic 
tangents of the surface at M, and the remaining sides intersect the asymptotic 


® Buchin Su, loc. cit. 
7° Buchin Su, “Some characteristic properties of projective minimal surfaces,” 
Science Reports of Téhoku Imp, Univ. (A), vol. 24 (1936), pp. 595-600. 


y 
h 
d 
nN 
ic 
Cc. 
18 
a 
), 
i 


120 CHENKUO PA. 


u-tangent. A similar result holds for the direction B?du-+ Kdv=0. These 
two directions coincide with each other when and only when the surface 
satisfies the relation A*B? — KK = 0. 


4. In particular, if the curve C is taken in the asymptotic direction v 
so as to possess the contact invariant 1—h with the asymptotic curve v at M 
and is denoted by Cy", then the quadric B, becomes a quadric of Darboux. 
The equation is easily found to be 


— — (K/h) = 0 
This may be rewritten in Fubini’s normal coérdinates as follows: 
cy —2t + [| (K —h(9w + By))/2h]z? =0. 
Hence we have the following theorem: . 


The quadric containing the v-regulus of a curve Cy" is a Darboux quadric 
uith index k =K/Byh. When the asymptotic curve v belongs to a linear 
complex, the corresponding quadric for any curve Cy* becomes the quadric 
of Lie. 

It seems of some interest to give here a new characteristic property of the 
quadric of Lie: 

The two consecutive asymptotic osculating linear complexes R, and f, 


(R. and R’,) have always one and only one common regulus (for any curve C) 


on the quadric of Lie. 


5. We shall conclude this paper by a remark on certain new invariant 
quadrics. The osculating linear complex /?, and the two consecutive osculating 
linear complexes P, and R’, along a curve C have a common regulus on the 


quadric W;: 
(A? + Kdu/dv) ysys = 9. 


Similarly we obtain another quadric W2: 


Y1¥3 — (B? + Kdv/du) yoy, = 0. 


For any curve C, both quadrics always pass through the two directrices of 


Wilczynski and two lines Je («= + 1) given by 


A GENERALIZATION OF ASSOCIATE QUADRICS OF A SURFACE, 121 


( V Rk? + Kdv/du Y2—e \V A? Kdu/dv ¥3 = 0, 
| VB? + Rdv/du V A? + Kdu/dv y, =0. 


The locus of the lines le (e== +1) for all the curves C through M is an 
algebraic ruled surface of order 4: 


(ysy2— A*ysys) — Bysys) — = 0. 


For the surface A*B* —- KK = 0 quoted above this ruled surface decomposes 
into a plane and a cubic surface. In this case each of the quadrics W, and W2 
decomposes into planes for any curve C of the direction A?dv + Kdu=0. 


THE NATIONAL UNIVERSITY OF CHEKIANG, 
KWEICHOW, CHINA. 


GENERALIZATION OF WARING’S PROBLEM TO ALGEBRAIC 
NUMBER FIELDS.* 


By Cart Lupwiae SIEGEL. 


1. Introduction. Waring’s problem consists in finding, for any fixed 
rational integer r > 1, a number m such that all positive rational integers v 
may be decomposed into m perfect 7-th powers of non-negative rational in- 
tegers. The first solution was given by Hilbert.’ 

Using their powerful circle method, Hardy and Littlewood * obtained a 
still deeper result containing Hilbert’s theorem: They proved that the number 


A(v) of positive rational integral solutions (kK =1,- -+,m) of the equation 


as exac 1e order of magnitude y”/"-!, for any fixed m mMo(7) ane 
has exactly tl | f magnitude v"/'-?, { y fixed | 


namely 


C'(m/r) 
where o, the singular series, is a function of v lying between finite positive 
bounds. 

Some years later, | * was interested in generalizing the circle method to 
an arbitrary algebraic number field A of degree n. Trying to solve the ana- 
logue of Waring’s problem for K, I succeeded only in dealing with the simplest 
case, the decomposition into square numbers; in the case of an exponent 
r > 2, however, the generalization of the major and minor arcs of the Farey 
dissection led to a difficulty which I could not overcome at that time. Recently 


| found the solution. 


* Received March 2, 1943. 

1D—D. Hilbert, “ Beweis fiir die Darstellbarkeit der ganzen Zahlen durch eine feste 
Anzahl nter Potenzen (Waringsches Problem) ,”’ Mathematische Annalen, vol. 67 (1969), 
pp. 281-300. 

*G. H. Hardy and J. E. Littlewood, “ A new solution of Waring’s problem,” The 
Quarterly Journal of Mathematics, vol. 48 (1919), pp. 272-293; G. H. Hardy and J. E. 
Littlewood, “ Some problems of ‘ Partitio Numerorum’; I: A new solution of Waring’s 
problem,” Nachrichten von der Koniglichen Gesellschaft der Wissenschaften zu 
Géttingen, Mathematisch-physikalische Klasse, (1920), pp. 33-54. 

°C. L. Siegel, “ Additive Zahlentheorie in Zahlkérpern,” Jahresbericht der Deutschen 
Mathematiker-Vereinigung, vol. 31 (1922), pp. 22-26; C. L. Siegel, “ Additive Theorie 
der Zahlkérper. I, Il,” Mathematische Annalen, vol. 87 (1922), pp. 1-35 and vol. 88 
(1923), pp. 184-210. 


122 


GENERALIZATION OF WARING’S PROBLEM. 123 


Let K"),- - -,K™ be the n conjugate fields, K‘) (J =1,- - -,m,) being 


real and K'), (J =n, + 1,- +,m, + m2) conjugate complex, 
iy + 2n2—n. A number v of K is called totally positive, if v‘? >0 
(J=1,---,n,). Since the number of totally positive integral solutions 
*>Am Of (1) in K is not necessarily finite, if n. > 0, we restrict these 
solutions by the further conditions |v | (k=1,---,m; 
l=n,+1,:--,,-+ 2) and denote their number by A(v). 


In the case of the field of rational numbers, it is trivial that any positive 
integer v is a sum of r-th powers of integers, namely v times 1". It is easily 
seen that the corresponding statement, without further restriction, does not 
hold for an arbitrary K: In a real quadratic field with discriminant 4d, 
d == 2,3 (mod 4), all integral squares have the form (a -+ b Vd)? =a’>+ 
+-2ab Vd with rational integral a,b; consequently a number p + qVda with 
rational integral p,q and odd q is never a sum of integral squares. This 
example leads to the introduction of the ring J, generated by the r-th powers 


of all integers in K; it consists of all numbers ++ + 
(hk =1,2,---), where are integers in K and are 


rational integers. Obviously A(v) = 0, if v is not a number of J,. It will be 
proved that J, is an order, and an explicit construction of J, will be given. 
In the above example, J. consists of all numbers p+ qVd with even q. 

Let D be the absolute value of the discriminant of K, and denote by 
N(v) = M the norm of the totally positive integer v in K. 


THEOREM. For any fixed 
(2) m > (27? + n)nr 
and M— 
(3) A(v) = 4 9 


where oo is a positive number depending only upon ny, N2,m,r and, in 


particular, 


(4) ( T'(m/r) ) (mz = 0). 


The singular series o lies between finite positive bounds, whenever v belongs 
to J,, and o = 0 otherwise. 

As a consequence of this Theorem, all totally positive numbers of J, with 
sufficiently large norm are sums of a bounded number of r-th powers of totally 


oD 


that then all totally positive numbers of J, will be such sums; however, the 


positive integers in K. It might be suggested, in analogy to the case n = 1, 


124 CARI. LUDWIG SIEGEL. 


following example shows that this is not true without further restriction. 
In the quadratic field with discriminant 24, the totally positive number 
5+2V6 of J. cannot be expressed as a sum of integral squares. On the 
other hand, it can be proved that all totally positive numbers of J, are a sum 
of a bounded number of integral r-th powers, if the field K is not totally real ; 
but this condition is not necessary. 

For the sake of brevity, the proof of the Theorem will be given only in 
the case of a totally real field K, i.e., no 0; as a matter of fact, the proof 
in the general case proceeds on the same lines, the formulae being somewhat 
more cumbersome. Probably, the reader will also notice several possible 
generalizations of the Theorem. The proof uses Vinogradow’s idea of sub- 
stituting finite trigonometrical sums for the generating power series in the 
original method of Hardy and Littlewood, with some modifications due to 
Landau.* For n=1, the domains B, introduced in Section 2 are the major 
arcs in the definition of Weyl.‘ 

Following Dedekind, we abbreviate the function e?"'* by the symbol 1’. 
Henceforth, small Greek letters without upper index denote points in the 
real n-dimensional euclidean space Ff, the codrdinates being designated by 


upper indices; e. g., {é(),- The numbers of the totally real 
field K are represented by the points a = {a ),- - -, a} of R, where a‘ is 
the conjugate of in K'” (J=1,---,n). We define S(é) +---+é™, 
N(é) =€™- + -&™, A relationship involving small Greek letters without 


upper index, the symbols S and N excepted, stands always as an abbreviation 
of the n corresponding relationships for the codrdinates; e. g., the inequality 
a< é means af?) < 

Small German letters denote ideals in K. The symbols N(a), ala, (a, 6) 
have their usual meaning. The numbers of any ideal a constitute a lattice 
in #& and any basis of a defines a fundamental parallelepiped in this lattice 


with the volume D4N(a). We choose a basis o,° * +,n of the unit ideal; 
then the inverse matrix = defines a basis -,pn of 


where d is the ramification ideal of K and N(d) =D; let FE be the corre- 
sponding parallelepiped in R, with the volume D+, 

For any totally positive unit « in K, the formula A(e’v) =A(v) holds 
good. Since there exist n —1 independent units in K, we may assume, during 
the proof of the Theorem, that 


*E. Landau, “tber die neue Winogradoffsche Behandlung des Waringschen 
Problems,” Mathematische Zeitschrift, vol. 31 (1930), pp. 319-338. 

5H. Weyl, “Bemerkung iiber die Hardy-Littlewoodschen Untersuchungen zum 
Waringschen Problem,” Nachrichten von der Kéniglichen Gesellschaft der Wissen- 
schaften zu Gottingen, Mathematisch-physikalische Klasse, (1921), pp. 189-192. 


GENERALIZATION OF WARING’S PROBLEM. 


(5) O(M*), O(M-OM), 


We introduce the abbreviations 


by (2), the constant a is positive. The symbols O and o refer always to the 
passage to the limit — o. 
Define 


where A runs over all integers in XK satisfying 0 <A < v/", and 
(7) = fm (€) 18, 


Since S(8) is a rational integer for all numbers B in d™1, we have 


(8) A(r) Df 
E 
where dv = dé") - - - dé is the volume element in R. 


2. Generalized Farey dissection. For any number y of K, let a= ay, 
be the denominator of yd, i.e., a= (1. yd)". We define B, to be the set of 
all points € of FP fulfilling the condition 


(9) N(Max(h | N(a"). 


It is clear that By is vacuous in case N(a) > t” = MVr-4, 
In the following, it will be sometimes tacitly assumed that M/ is sufficiently 
large, i.e., MZ > Mo, where M, depends only upon K, m, r. 


Lemma 1. If y¥, then B, and Bf have no common point. 
Proof. Let é be a common point of By and By, and put 
Max(h |é—y|,¢7?) Max(h|é—7|,07) af 


then 
rst, 7St, N(aa) S N(7?) 
N ((y—7) aa) S (2th-*)" — 


On the other hand, the ideal (y— y)aad is integral and therefore 


125 

) 


126 CARL LUDWIG SIEGEL. 


N((y— 7)aa) = D*; 
this is a contradiction. 


LemMa 2. Let é be a point not lying in any By. There exist an integer 
ain K and a number B of d' such that 


(10) legé-—Bl <h, 
(11) Max(h | —B|,| «|) = D+, 
(12) Max(| a |,---,| |) 
(13) N((a, BD)) < 


Proof. Applying Minkowski’s theorem to the system of 2n linear forms 
d 


n n 
21, > (Eo. 71 — pi yz) (k =1,- 


with determinant + 1, we obtain a solution a, 8 of (10) with 1|a, d°"|B. Set 
—y and (1, then ala, N(a) =| N(a)|. Since € is not a 


point of By, we have 
N(Max(h | é—y |, t-?) > N(a*), N(Max(1,¢* | @|)) >1, 
and (12) follows. 

Consider now all pairs a, 8 satsifying the conditions 1|a, d-*|8 and (10) ; 
they form a finite set ©. Choose 2,8 in © such that the number 
Max(| a |,---,!a |) attains its minimum 0; by (12), @>¢. We are 
going to demonstrate that this pair fulfills also the conditions (11) and (13). 
Put (a, and let x be a number of g. The pair 


belongs to S, whenever the conditions 

are satisfied, and then, by the definition of b, 

(15) Max(| |,---,| a |) 


If N(q) < D+, Minkowski’s theorem shows the existence of a number 
«x in g such that 0< | « | <1. Then (14) is satisfied, by (10), and (15) 
leads to a contradiction, since | @|<|a |. Consequently N(q*) < D3, and 
this is the assertion (13). 

In order to prove also (11), we may obviously assume 


(16) ja | < DA, 


whence n > 1. Applying again Minkowski’s theorem, we construct a number 


«x in q with 


= 


1S 


GENERALIZATION OF WARING’S PROBLEM. 127 
(17) 0< | S Ds, == 2,---,n); 


then | |< Land | < | af) | (J 2,---,n). Since | a | 
for at least one value of 1, we have 


Max (| | am i) < Max(1, | a!) |) =); 


in contradiction to (15), if the pair a, B were in G; consequently the conditions 
(14) are not all satisfied. On the other hand, by (10), (16), (17), 


lalSh, cht (l=2,---,n); 


hence 
| — BY | (h | = Dh, 


and (11) is proved. 

Let y run over all numbers of K. It follows from (9) that only a finite 
number of the domains By, enter into the fundamental parallelepiped EF; let 
Li, be the set of all points of # not contained in any B,. We choose now a 
complete system T of modulo incongruent numbers y with NV(ay,) S 
If € is a point of HF — Hp, then there exist a number £ in dD and a number y 
in I such that é— 8 = lies in B,; in view of Lemma 1, 8 and y are uniquely 
determined. On the other hand, for any 7 in Ff, there exists a number f in 
2’ such that »-+ 8 = é lies in FZ, and B is uniquely determined except when 
é lies on the frontier of #. Consequently, the formula 


(18) 
Eo 


holds for an integrable function g(é), whenever g(€é+ B) —g(é) for all B 


in d"*, and in particular for the function defined in (7). 
3. Approximation to f(é) on B,. Let é be a point of By, E—y—€ 
and a= (1, yd)~*; then 
N(Max(h | £|,¢7*)) = N(a*), N(a) S 
We determine a point 6 > 0 such that 
D2", -N(0) = (a). 


On account of Minkowski’s theorem, the ideal a contains a number @ with 
0<|a/=6. Then ga* =b is an integral ideal and N(6) S D4; hence b 


= 


128 CARL LUDWIG SIEGEL. 


belongs to a finite set depending only upon K. Choose a basis B,,° - -,Bn of 
then a = has the basis a, = «2 (kK =1,:--,n) and 
== O(6) (kam 


If » runs over a complete system of residues ‘modulo a, we have, by (6) 


(19) f(é) = b = Ota) 


Put A= 91%, +--+ ++ Gn%n, with rational integral gi,---,9n, and let Fy 
be the parallelepiped of the points with 
Sgo.+1 For all A occurring in (19), 


— A= 0(0) = O(t) = 
(nt (A+ = +h + [A+ 
= £00 h0 O(M~/"), 


Since F has the volume D4N (a), we obtain 


(20) pS (Ate) 9) — DAN (a) f Ody + O( 
Ey 


The number of all A in a, satisfying 0<A+y< vv" for fixed yp, is 
less than 


(21) 1+ N(v")N(a*) = 


On the other hand, for fixed », the sum of the F/) .is contained in the rec- 
tangular parallelepiped c@ and contains the smaller 
parallelepiped cd << y+ p< — c6, with a suite bly chosen positive c= O(1). 
Since the difference of the volumes of these two parallelepipeds is 
M/7) 0-1/1) (¢) = we obtain, by (20) and (21), 


> DAN (a) f 180°) dv + 


Setting 
Gy), 
u(mod a) 
we get the required approximation 
yi/r 


(22) f(é)= + O(M/r-a/n), 


0 


| 
| 
a 


GENERALIZATION OF WARING’S PROBLEM. 129 


4, Estimation of f(é) on E,. Let € be a point of Zo. Applying Weyl’s 
method of estimating trigonometrical sums, we obtain 
where the summation is carried over the systems of integers A, Ai,° * * ,Ar-1 


in K defined by the 2”-'n conditions 


O<A+A 


Then 
and for each system A,,° * *,A,-, the point A runs over all integers in a rec- 
tangular parallelepiped P = P(Ay,° Ar+) Whose sides are < 
= O(M/""), 
Put 
(24) r! A, *Ar-1 FB; 
ACP 


For any fixed integer » in K, we obtain 


pS ue) — + OC MEM 
A-w C_P 


whence 


(25) w= O( Min | 18m — |-1,. | 


where w,,° * *,@, are the basis of all integers in K. 
Let 
S (oxpée) == (ly. dx, 4 di, 4 (k = By n), 


with rational integral a,. and define 


n n 


Dd ape = > = €; 
k=1 k=1 


then 
0, 6, dy = S (axl), 1S(oxug) (k = 1,- 


In view of Lemma 2, there exist two numbers a, 8 in K satisfying (10), (11), 
(12), (18) and 1], If exactly q of the conjugates ,- - -, are 
of absolute value < D+, then 0g =n—1, by (12), and we may assume 


of 
Yk 
is 

er 
). 
is 


130 CARL LUDWIG SIEGEL. 


(26) | >t, < DIA(1SkSq), |e 


Since £=O(1)Max(| d, |,- - -,!dn|), we conclude from (25) that 


(27) Ara) = O(MO/) Min (Mir, | 

The point £ depends only upon p, for any given € in Hy. On the other 
hand, for fixed y», the number of integral solutions A,,- - -,Ar-1 of (24), satis- 
fying | is O(M**/) in case 
O(M*) otherwise, A denoting an arbitrarily small positive number. Introduce 


the abbreviation 
(28) Min(M?/"r, | |-1) == 7 (p); 
then, by (23) and (27), 


where » runs over all integers in K satisfying 


(30) rly, 
Let 9:1,° be rational integers and let W = - -,gn) > 0 be 


the number of different integers » in K fulfilling (30) andthe n conditions 
(31) gx = Max (| D4) +1 
Let be one of these » and pé = Setting — B and a(d — 3) 
-— B(u— =k, we obtain 
(32) | < 
(33) «= 8(n— —a(t—Q). 
On account of (10) and (30), 
8(u— ‘m)(1-1/r)) O(M-@/")) o(1 ), 

whence, by (32) and (33), | «|< D-“/”; but « is a number of d-’, and con- 
sequently x = 0, 
(34) (u—p)/a (0—9)/B = (t{—2)/2. 

This proves that a is a divisor of (u—j) Bd. In view of (13), we infer 
that «|v(~— ji), where v denotes a positive rational integer depending only 


upon K. Since 


— pi) /2 = 0-1/r)) 


| 
| 


her 
tis- 
and 


uce 


er 


GENERALIZATION OF WARING’S PROBLEM. 
and, by (11), (26), (34), 


it follows that the number of values of the differences » — A is 


1+ O(h2) | ; 
k=q+1 
hence 


k=qt1 
In view of = O(1), we have = O(1)(kKSq) and = O(| «™.|) 

> q), by (26) and (31). For any fixed g, 4g, the number of possible 
n-1 

systems 9:,° * *,9n-1 in (31), with W > 0, is O( JJ | «™ |). Consequently, 
k=q+1 

the number of integral » in K, fulfilling (30) and the single condition 


(35) gq = 2 | qin) < g + i, 
is 
(36) Wo= W(g1,° Gn-1,9) 


=O(1+ TE TT | 


k=q+1 k=qt+1 
O(h n-q-1 + Mi-1/rta(1-1/n) | 
MO -1 n)(1-1/r+0) - MQ/n)0-1/r) | gin) 


Defining 


0<9 Jam | 
we obtain, by (28), (35) and (36), 


(37) => W,0(Min( | g |, | g +1 [7] |) 
g 


lul<r! 


= 90 (1 + M (1/n) G-1/r) | qin) 
By (10) and (26), 


O( M1/nr x(n) | log | gi) | + h log h) O( G-1/rta) log 
QM | gin) + log h)O(M/") = log M), 


and (29), (37) lead to the required estimate 


(38) f(€) = 


3i 
n), 

be 


132 


CARL LUDWIG SIEGEL. 


5. Proof of the asymptotic formula for A(v). The relationship 


(39) G(y) =N(at) = (N(a))-°/0(1) 


u(moda) 


may be proved in exactly the same way as in the case n» = 1. Moreover 
yl/r 


17’Sdn = O(Min(v”, | ; 
0 
consequently (22) leads to the formula 
r 


f™(é) D-(m/2) Gm (y) N™ ( f O (Mm (/r-a/n) ) 


0 


(N (a) (Min r ))O( M r-a in). 
for all y-+¢ in B,, whence 


foe )dv J)-(m/2) ve (y)1- S(vy) f 17’Sdyn) 189 dv 


By By 


0 
( MmQ/r-a/n) ) + O (Mm/r-a n-1), 


On the other hand, for any point é of  — B,, the inequality 


h | > (N(a)) 


is true for at least one k; therefore 


yi/t 
f 17’Sdy) 1-809 dy N™ (Min | )dvO(1) 


R-By R-By 


f dz0 ( M G-1/n)) (N (a) ) ( m/r-2) (ra 


h-3(N(a) )-a/n) 


Since 
f N™ ( 121’ dn) 1 dy 
° 


with the constant o) defined in (4), and ma > (m—1)a= (m—1)/nr 
— 2r1=>n, by (3), we obtain 


(40) > ff 9(8)dv —D-™ 


(M™/r-*) -+ O (M'm/r- 1)(1+a/n-1/nr) ) 
=== (M™/r-*) +. O 1/n) (1/r-a) (m/r-n-1) ) o( ) 


|| 
pl/ r 


GENERALIZATION OF WARING’S PROBLEM. 133 


with 
y (mod 97?) 
where y runs over a complete system of incongruent numbers in K modulo 0". 
The first assertion of the Theorem, namely formula (3), follows now from 
(8), (18), (38) and (40). 


6. The singular series. For every ideal a in K we define 


f(a) = > Gm 


where y runs over a complete system of modulo (ad)-? incongruent numbers 
satisfying (1, yd)-' then 


o= > H(a), 


a 


the summation extended over all integral ideals a. 
Denote by A(v,a) the number of modulo a incongruent systems of in- 


tegral solutions of the congruence 
Au” +: + An’ =v (mod a), 


and let A,(v,a) be the number of modulo a primitive solutions, i. e., satisfying 


(A1,° Am,a) = 1. 
Exactly as in the known case n = 1, the following four statements are 


proved, for any m > 2r. The singular series o has the factorization 


r) Th = > H(p*), 
)). 
where p runs over all prime ideals in AK; the singular series vanishes, if and 
only if o,—0 for at least one p; let p’ and p* denote the highest powers of 


p dividing v and r, then 
y= A p27) (m 1)q) > b 2C), Th = N (p- (4 2c) 


the singular series possesses a positive lower bound for all v in J,, if 
Ay(v, p*¢*1) > 0 for all prime ideals p. 


In order to prove the second assertion of the Theorem, concerning the 
value of o, we consider now more closely the ring J,, generated by the r-th 
powers of all integers in K. On account of the identity 


=] 


134 CARL LUDWIG SIEGEL. 


r-1 


the number 7! belongs to J,, for any integer p in K; since also 1 belongs 
to J,, the ring J; is an order. Let p be a prime ideal; we say that an integer v 
of K belongs to J,(p), whenever v is congruent, modulo p%, to a number vw 
of J,, for gq =1,2,- - - ; obviously J-(p) contains J, and constitutes also an 
order. If (r!,p) =—1, then J-(p) =J,. Moreover, it is easily seen that v 
belongs to J,(p), if the congruence v= vg (mod p%) has a solution vg in J; 
for the fixed exponent g = 2c + 1, with the above definition of ¢; consequently 


v belongs to J,(p), if and only if the congruence 


has a solution in non-negative rational integers ~%,<h (k=—1,---,h), 
where h = N(p***?) and m,° °°, constitute a complete system of integral 


residues modulo p*¢*? in K. On the other hand, using a basis of the order J,, 
one proves immediately that v belongs again to J,, if it belongs to J,(p) for 
all p. These remarks provide a method for the explicit construction of J;. 
lf vy is not in J,, then it is not in J,(p), for some p, hence a fortiori 
A(v,p’?) =0 for g > 2c, and Ty=0, o=0. Consequently, for the com- 
pletion of the proof of the Theorem, it is sufficient to demonstrate the 


following 
LEMMA 3. Jf m > (2"*-+ n)nr, then Ao(v, p***!) > 0 for all vin J,(p). 


Proof. Put N(p) = p’, p being a rational prime number, and let p! be 
the highest power of p dividing p; then gl = n and I|c = fl, where p! denotes 
the highest power of p dividing r. 


The numbers of the ring J, form modulo p***? an additive Abelian group; 


let s be its order, then s| = = Since J, is generated 
by the 7-th powers of all integers, there exist integers m,° -* -, ya in K and 
rational integers gq, >1 (k=1,:--,d), with qi- such that the 


linear form + + wana” (te = k= 1,-:--, d) 
uniquely represents all numbers of J, modulo p*¢*?, 

Let m, denote the smallest number such that every rational integer is 
congruent to a sum Ym," modulo p? (q—1,2,-- where 
Yi," Ym, are rational integers; define 


d 
> Min(qi —1, mp) = j. 
k=l 


ngs 
er v 
an 
at 
itly 


be 


1p; 
ted 
und 
the 

d) 


is 


ere 


GENERALIZATION OF WARING’S PROBLEM. 


The congruence 


has an integral solution A,,- - -,A,; in K, whenever belongs to J,(p). If 
(v,p) =1, then the solution is certainly primitive modulo p; if p|v, then 
(v—1,p) —1; consequently, in both cases, Ao(v, p*°*?) > 0 provided that 
m=j-+1. Therefore it is sufficient to prove the inequality 


(41) (27% 4+1)nr. 

Since is a power of p and qi: we infer that 
dSn(2f+1). On the other hand, it is known that m,< 4r for r > 3, 
My == 4 for r—2,3. Moreover, in case (p,r)=—1 and 2f+1 


<= 3(log r/log p) in case p|1; hence 


| < ars > 4), 
4) J 
( nr nr 427 (r>2, (p,r) =1 or p=r=—3), 

In the two remaining cases p = 2, r = 2/, f = 1 or f = 2, we set gy = 2% 
(k =1,:--+,d) and assume that the values (u—1,2,: --,2f) and 
dy. > 2f occur exactly hy and times. In both cases, = 2°/ r?, and 
therefore 


| of 2f+1 
+ >> (2* —1)hy= > uhy S n(2f + 1), 
u=1 
whence 


(43) j/nr S (r? —1)/r (14+-1/2f) < %rS2741 (plr—2,4). 


The assertion (41) follows from (42) and (43); and the proof of the 
Theorem is now accomplished. 
As an immediate consequence of the Theorem, there exists a positive 


rational integer w depending only upon K and m such that the equation 


(44) (E,/w)" +--+ ++ (Em/w)" =v 
has a solution in totally positive integers é,,---,&» in K, for all totally 


positive integers vy in K, if m > (2"*-+n)nr. This particular result can 
also be found in the following simpler way: 

According to (5), we may assume Cy (k,l=1,: -+,n), where 
the constant C depends only upon A and r. Since the numbers of K lie every- 


where dense in 2, we may construct n totally positive numbers #,,- - -, 0, in 
3 3 n 


35 
| 135 
: 
h), ¥ 
ral | 
J 
for § 
iorl 
the | 


136 CARL LUDWIG SIEGEL. 


K such that the matrix (7), with 4: = 97’, lies in any given neighborhood 
of the unit matrix (e7); hence we may assume | yi) — ex | < 1/Cn, where 
= Then v= +: - -+ and,", with 

S(yxv) = (1 — 1/Cn)v™ (1/Cn) 


> (1 —-1/Cn—- (n— 1) /n)v™ > 0 (k= 1,---,n). 


Choose a positive rational integer v such that the numbers vi, and v"y, 


(k=1,--+-,n) are all integral; then va, is a positive rational integer. 
Assume now that the Waring-Hilbert Theorem holds for the exponent m = mp, 
mo 


in the field of rational numbers; then v’a, = > 2x." with rational integral 


(kK=—1,:--,n; 1=1,--+,mo), and even zy: > 0, if v is chosen 
sufficiently large. It follows that (44) has a solution, if w =v? and m = mgn. 
Using the Theorem for n= 1. we infer that m= is a 
sufficient condition for the existence of a solution of (44). This condition is 
weaker than (2), in case n > 1, and it might be suggested that the Theorem 
remains true, if this condition is substituted for (2). The demonstration of 
the suggestion can be performed by using sharper estimates in some places of 
our proof. 

In the case r= 2, it is known that the Theorem holds even under the 
condition m > 4, independent of n, instead of (2). The question arises 
whether the lower bound (2"*-+-»)nr-+ 1 for m could be replaced by a 
function of r alone; however, the solution of this new problem seems rather 
difficult. 


INSTITUTE FOR ADVANCED STUDY, 
PRINCETON, N. J. 


AN UNSOLVED CASE OF THE WARING PROBLEM.* 


By Ivan NIVEN. 


The Dickson-Pillai-Vinogradow solution’ of Waring’s problem leaves 


untreated the case in which 


(1) r= 2" — g — 2, 
where 
(2) 3" = +1, O<r< 


The Waring problem is the determination of the value of g(n) such that every 
positive integer is expressible as a sum of g(n) positive or zero n-th powers, 
whereas at least one positive integer is not a sum of g(n) —1 n-th powers. 
The value of g(n) has been determined for n > 6, unless r has the value (1). 
We prove here that in case (1) holds, g(n) has the ideal value J, that is ? 


(3) g(n) =] =2"+ q—2. 


By the remark immediately following Theorem 4 of (D) we consider 
n>180. We shall make use of Lemma 10 of (D), altered slightly to suit 


our purposes: 


If n > 180, L = 6", and all integers in the interval (L, L + 2") are sums 
of m n-th powers, then every integer = L is a sum of m+q+s—2 n-th 


powers. 
The integer s is defined by 


(4) s=f+2g, f=[(4/3)"], g = [(5/4)*], 


where [a] denotes the greatest integer =z. Lemma 10 of (D) has n= 35 


* Received October 21, 1942. 

1L. E. Dickson, “ Proof of the ideal Waring theorem for exponents 7-180” and 
“Solution of Waring’s problem,’ American Journal of Mathematics, vol. 58 (1936), 
pp. 521-535. These papers are written with a continuity in the numbering of formulas, 
sections ete., and so we refer to them jointly as (D). 

* Thus the values of g(n) for n > 6 are complete, as follows (using f as defined 
in (4)): if r< 2"—gq, g(n) =71; if r2=2"-—-q, g(n) =I+f or 1+f—1 according 
as 2"== or < 


A 
133 


138 IVAN NIVEN. 


and the inferred hypothesis that 1 = 4". The change in the inequality satis- 

ec corresponds to replacing 3 by 6 in the inequality following equation 
fied by L ls to replacing 3 by ¢ th quality foll g equat 
(15) of (D); the resulting inequality is true for n = 8. 


We write, as in (D), 
(5) 4” 3"f + 2"h + J, 3", OSj< 2. 
Using (1) and (2) we have 
(6) 4n 2"(of + f +h) + —2f, 


so that 
qf 2f (mod 


But also gf = [(3/2)"]-[(4/3)"] < 2" and 2f < 2" so that we have 


(7) J=Qq+2f with h=—2"—of—f, 
or 
(8) J=of + 2f—2" with 


whichever value of 7 satisfies the last inequality in (5), the corresponding 
value of / arising from (6). 

We now find an interval (L, L + 2”), every integer of which is a sum 
of 2"—s n-th powers; this will enable us to apply the lemma from (D). We 
use different values of Z in the two cases, (7) and (8). 

If (7) holds we take L = 4"f. All integers in the interval (4"f,4"f + 2” 
—s—f) are clearly sums of 2”—s powers. We rewrite the next integer 


in the form 

(9) (f?+- + (fh —tq—t + f+1)2"+ fj + ta + 2t—2*f—s—f +1, 
by means of (5), (1) and (2), and the arbitrary integer ¢ is chosen to satisfy 
t— [2"f/(q + 1)]—f 


Note first that ¢ is not negative. For by (7) 2°» 2 qf+f so that 
2"f = f?(¢-++1). We shal] also need the inequalities 


(11) 2"f/(q +1) —f*—1, 
and 


(12) tS 2"f/(q +1) —f? = f(2"—af —f)/(a+1) <f 


iat 


AN UNSOLVED CASE OF THE WARING PROBLEM. 139 


the last step being a consequence of 


Now the integer (9) is clearly a sum of f(f ++ j—2") + 2t—s+2 
n-th powers, and this number, by (7) and (12), is less than 2f? + 2f—s- 2. 
Hence all integers from (9) to 4"f + 2" are sums of 2f? + 3f + 1 n-th powers. 
For n > 180 this is less than 2"-— ss, since f? < (16/9)" and s < 2f. 

To complete this argument we must show that the coefficients of 3", 2” 
and 1 in (9) are not negative. The coefficient of 3” is positive since ¢ is not 
negative. The coeflicient of 2" exceeds fh —t(q-+-1), which is not negative 
since the first inequality in (12) may be written ¢ << fh/(q+1). Finally 
we have 

fo +tq+ 2t—2f—s—f+1 
> + + 2f — —f? —q—1—2f—s—f+1- 
—f—q—s—f, 


by applying (7) and (11) to j and ¢ respectively. An easy computation shows 
this to be positive for n > 180. 

If (8) holds we take L = 4"u, where wu is an integer to be specified. As 
before we take the interval (4"u, 4"u + 2"—s—vu) for granted, and begin 
with VN = 4"u + 2"—s—u+1. We have, by (5), 


(13) N = 3"(uf) + +1) + uy —s—u+l. 


First we suppose that 7 > f/2. Taking w= 4 we see that N in (13) is 
a sum of 8f —s-+ 2 n-th powers, by (8). Hence all integers in the interval 
(VY,N +s-+3) are sums of 8f-+ 5 powers, which is less than 2"—s for 
n> 180. Again we must demonstrate that the coefficients of 3", 2” and 1 
in (13) are not negative. Our hypothesis on j enables us to write the only 
inequality needed, 


uj —u+1=—4)—s—3 > 2f—s—3>0. 
On the other hand suppose that j S f/2. Then choose w to satisfy 
u=[g/h] +1 
sc that wu is positive, and we have 
(14) uh > q, uh 


Since j + h =f +1, our hypothesis on j implies h > f/2 so that the second 
inequality in (14) implies 


S- 

ng 
m 
Ve 
‘er 
fy 
= 


140 IVAN NIVEN. 


(15) u<2q/f+1<f. 
We can write (13) in the form 
N = 3"(uf +1) + 2"(uh —q) + uj—s—u+q+4+3, 


so that NV is seen to be a sum of u(f + h + j —1) —s+ 4 n-th powers. By 
(8) this equals u(2f) —s-+ 4, which is less than 2f?—s-+4, by (15). 
Hence any integer in the interval (V,N + u+s—1) isasum of 2f?+u+3 
n-th powers, and (15) implies that this does not exceed 2f?-+ f + 3, which 
in turn is less than 2"-—s. The coefficient of 2" in NW is positive by the first 
inequality in (14), and the coefficient of 1 is 


uj 


by (15) and the definitions of g, s and f. 

Thus we have exhibited an interval (L, Z + 2”), every integer of which 
is a sum of 2”——s n-th powers. In case of (7) we had L = 4"f, in case (8) 
I = 4"u with u=4 and u<f in the two parts of the proof. The lemma 
quoted from (D) is applicable since 4"f < (16/3)" <6". To complete our 
proof of (3) we have simply to show that every integer VN < 4"f is a sum of 

= 2" + q¢—2 n-th powers. This we proceed to do. 

We separate the work into cases, since no unified treatment seems available, 

The integers < 3” are easily handled, so we take N between 3” and 4"f. All 


integers are given by 


(16) N = 4"w + + 2y 4+ 2 
052 


IIA 
1 


Case l. < 4"f, 2S 2"—2f—1. Then.N is a sum of r+ y 


+ 2-++ w n-th powers, and this is at most 
+ —1 4+ f—1 7. 


Case 2. N< 4", 2 = 2" — 2f: The first hypothesis implies w = 0, 


and we can write 
N = (c—-t)3" + (y + gt + t)2" + 2—gt— 2%, 


with the integer ¢ to be specified. We need non-negative coefficients here, so 
we must have {=z and t=2z/(q-+2). These are satisfied if we choose 


t = min{z, [z/(q + 2) ]}. 


|). 
| 


AN UNSOLVED CASE OF THE WARING PROBLEM. 141 
Now N is a sum ot y+ 2-—2t n-th powers. If we have 
sinceex >0. If t= [z/(q + 2)] we have 


since 


22 S 2" 4 4f > of + 2"—4f > (¢ +2) (f+ 3). 


Case 3. 4° SN < 4"f, 22 2"—2f, wS2—-3. Equations (1) and 


(2) enable us to write 


N = w4" + 38" + 2"{y + (x —1)(g+1)} +2— (@—1)(q4+ 2). 


This is a sum of w+y-+2z2—2++2 n-th powers, and our hypothesis on w 
implies that this does not exceed £. The coefficient of 1 in N is positive since 


= (2"— qf) + (q—4f) +2>0. 


Case 4. < 4"f, (7), 2"— w23. We can 


write 


N= 3" (2x wf + 1) 2"(y wh — tq 1) 
2+ wj+tq + 2t—2"(w +1) 


and we define ¢ by the equation 


t=|[(y+wh)/q] 
so that 
(17) tgq+q>ytwh2 tq. 


The second part of (7) implies that h Sq whence we have 

(18) 

We again use (7) to conclude that N is a sum of 


n-th powers, and the inequalities among the hypotheses of the case and (18) 
show that this does not exceed J. The coefficient of 2” is not negative because 


| 

| 


142 IVAN NIVEN. 


of (17) and (18). Finally we have 


2+ wi + tg + 2¢— +1) = 2 + — 2") + tg —2" 
>— f+ w(j—2") + wh—gq wf —q— 


Thus we see that our method is not adequate in case y + wf < q+ 2f but in 


this case we have 


2f—uf 
sw+2+ 0+ 9+ 2/—of 


w+ 2f—wf+3<iI, 
where the last inequality follows from w = 3. 
Case 5. < 4"f, (7), z= 2"— 2f, w=x— 2, or 2. The 


last two hypotheses imply that r= 4. If 2"—8 we have 
*—84+2—1. 


Consequently we can take z > 2”-—8 for the remainder of this case. 
First we assume that y+h <q. By (7) it is seen that h = 2"—j+f 
whence h > f so that y< q—f. Hence we have 


If on the other hand y + h= q, we write 


N =4"(w—1) + + f +1) 
+2"(y+h—q+1) 
which is easily shown to be a sum of J n-th powers. The coefficient of 2” is 
clearly positive, and we also have 
2+j—2"+ 94+ 2—2" 
— 1+ of + + 
= (af +f+q+1—2") + (f—8) >0. 
Case 6. 4°= N < 4"f, (8), 22 2"— 2f, w=ax—2. We divide this 


case into three parts. 
If y+ wh = q we can write 


N = wf +1) +2"(y + wh —q) + 2+ 


AN UNSOLVED CASE OF THE WARING PROBLEM. 
If z-+ w) = 2” we can write 
N = + wf) + wh +1) +2+ — 2". 


The necessary inequalities are obtained as before. 
sa If y+ wh =q—1 and z+ wj S 2"—1, these inequalities add to give 
yt+2+uw(f+1)S/, and we have 


SI since w>0. 


Thus our six cases are completely treated, and the various hypotheses 
cover all integers in (16). 


PURDUE UNIVERSITY. 


‘he 
+ 
is 
his 


THE FAN INTEGRALS INTERPRETED AS MEASURES IN A 
PRODUCT-SPACE.* 


By A. J. WARD. 


In a recent paper’? 8. C. Fan has defined four related integrals of a func- 
tion f(x), in general non-measurable, on a set HE. If the function and set 
concerned are measurable, these all reduce to the Lebesgue integral, or rather 
to the analogue of the Lebesgue integral for a general measure-function. Now 
it is well known that the Lebesgue integral of a non-negative function is the 
measure (in a space of one higher dimension) of the ordinate-set of the func- 
tion.” In this paper we prove an analogous result for the Fan integrals, show- 
ing that they may be regarded as upper or lower measures of the ordinate-set of 
f(x), and examine some of the theorems of SF in the light of this conception. 
We then consider more especially the case when the basic measure-function is 
regular,® showing that in this case the Fan integrals may be expressed as 


Lebesgue integrals of certain measurable functions associated with f(z). 


1. As in SF, we consider an upper-measure function m(J’), defined, 
finite, and non-negative for all subsets of a given fixed set Ho, and subject to 
the following conditions: * 


(a) If then m(F,) m(E£,). 


* Received May 20, 1942. 

18. C. Fan, “Integration with respect to an upper-measure function,” American 
Journal of Mathematics, vol. 63 (1941), pp. 319-337. This paper will be referred to 
as SF. Theorem 1 SF denotes Theorem 1 of Fan’s paper, and so on. 

* By the ordinate-set of f(z) on H we mean the set of points (#,y) such that 
weH, O=y=f (a). 

* An upper measure m is regular if to each set E (of finite upper measure) there 
corresponds a set H, measurable (m) and including HZ, such that mH = mE, and 
therefore also m(XH) =m(XE) for every set X measurable (m) (such a set will be 
called an equimeasurable cover of ZH). The most important property of regular upper 
measures is the following. If (#,) is a sequence of sets such that E, (CE, , for 


each n, then m( SF,) == Tim mE. See C. Carathéodory, Vorlesungen ueber reelle 
n=1 nx 
Funktionen (2nd edition, Leipzig 1927), pp. 258 ff., especially Satz 15, 270. 
* The set H,, and its subsets, may be composed of elements of any nature; a typical 
element of HZ, will always be denoted by «. As we do not consider any questions of 
topology in Z,, we do not require Carathéodory’s fourth axiom (loc. cit. 239). 


144 


cal 


THE FAN INTEGRALS INTERPRETED AS MEASURES IN A PRODUCT-SPACE. 145 


(b) For any two sets EF’, 
m(E + + m( FR’) S mE + mE’. 
(c) For any sequence of sets (Fn), 
m( > Fr) => mE). 
n=1 n=1 
We assume also that for the empty set O, m(O) = 0. 

Conditions (a) and (b) suffice to allow us to define a corresponding lower 
measure m(H), as m(H)) —m(E,—F), with the usual properties. If 
= is called measurable (m). Measurable sets have the usual 
properties, and (c) ensures that the measure-function is completely additive 
over measurable sets. Again, given any subset L, of Eo, we may define (for 
subsets of H,) lower measure relative to Ey, as m(H,) —m(E,— E), and 
speak of measurability relative to F. 

For simplicity we consider only bounded functions f(z). By the addition 
of a constant we may then suppose that 0S f(x) < M, where M is some 
constant. In this case the most convenient definitions of the Fan integrals are® 


f y) lay, 


M 
m[E(f <y)]dy, 
JE 


0 


and 


J f du* and J f dp» being defined similarly in terms of the lower measure m. 


We remark that if SC # and mS = mE, then 


E 
We may a all these integrals in terms of 0 integrals. Let us write 


cv) =f(z) on F and =0 on f.(x) =f(z) on EF and 
(x) = M on — FE. g(x) = M— f(z), and = M—f, (xz). Then ® 


M M M 
mL E(f <y)|dy= > M—y) =f m[E(g > y) |dy 
170 


0 


(changing y to M —~y), so that 


(1) f M(B) — f 
JE E 


° Cf. equations (4) (321) and (16) (328), SF. We remark that on the right hand 
side we have ordinary Riemann integrals, the integrand being in each case a monotone 
function of y. 

®°Cf. Theorem 7 SF. 


10 


ef 

er 

he 

1C- 4 

of 

ym. 

is 

ad, § 

to 

to 

iat 

ere 

nd 

be 

for 

| 
|_| 


146 A. J. WARD. 


Similarly, since 


mLE(f > y)) = > y)] = mEo— m[ (91 = M— y)] 
= mE, — m[E.(g1 > M — y)] 


for almost all positive y,” we deduce that 
(2) f dp* = M-m(E,) — gidu*. 
E Eo 
Finally, 
ml E(f <y)] = m[Bo(fs < y)] (if y= M) mE, — m[Bo(f2 = y)] 
so that 
(3) f fide du* —M[m(E,) — m(B)]. 


The last expression may be simplified if there exists an m-measurable set 
K CE such that mK = mE. For then (cf. Theorem 2 SF) 


Eo K Eo-K 
f f(x) dp* + M(mE, — mE) 
K 


since m|[ (E,—K)(f2 > y)]=m(E,—E) for all y <M. Hence in this case 


(4) f f du —{ f dp*. 
E K 


Similarly 


(5) f du. 
BE 


2. We now define an upper measure, which we shall denote by m X Lh, 
or by m, in the Cartesian product-space Ey X <0,M>. Let E be any set in 
this space, and let it be covered by a sequence of sets E, each of the form 
En X In, where E, is a subset of Hy and J, is a measurable set (in the ordinary 
Lebesgue sense) in the interval <0, >. Then mE is defined as the lower 


7 SF, 321, Remark (3). 
* Two important particular cases are (I) if the measure is regular; (II) for any 
measure, if H is measurable (m). 


et 


ny 


THE FAN INTEGRALS INTERPRETED AS MEASURES IN A PRODUCT-SPACE. 147 


co 
bound of for all such coverings.® mE is clearly defined, 
n=1 
finite, and non-negative for all subsets of <0,M>. 
THEOREM 1. mE satisfies the conditions (a), (b) and (c) of 1. 
(a) and (c) are clearly satisfied ; it remains to prove (b). Let E and E’ 


be any two sets of the product-space, covered by sequences } (Hn X Jn) and 


n=1 

X J’n) respectively, such that -| Jn | mE and 
n=1 n=1 

> | J’, |— m(E’) are each less than a given positive number 
n=1 

Take any integer N such that (mEn) - | Jn | (mE'n) and 

n> 


consider the 2*. non-overlapping “ elementary puheihs ” obtained by forming 
the product of any selection from the sets J,,J2.,---,Jw, J’1,°°+,d’y and 
the complements of the remaining sets of index = N. Arrange these ele- 


0 


mentary subsets *° in any order as K,, K2,: - -, Kp, say, where P = 27%. Any 
set Jn, where n = N, is the sum of a certain selection of the sets Kp. 

Since the sets K, are measurable and non-overlapping, we may, for all 
n= N, replace Fy, K In by (En X InK2) ++ + (En X JnKp) 
without altering the sum 3(mF,)-!Jn|. For any given pS P, let r,s,---,¢ 
be those indices such that J,,Js,- - -,J+¢ contain Ky. The sets (Hn XK JnKp), 
n==1,2,---,N, can be replaced by (H-+ +---+-+ X Kp, since 
JnKp = K, for and J,K,—0 for the remaining values of n; 
and by (b), 1, we have not increased the sum 3(mE,)-|Jn|, since 
m(E,+---+ mE; We may work similarly with the 
covering J’,). Thus we may replace the original coverings X Jn) 
and by new coverings 


(Fi Ki) + (Po X Ke) +: (Fe X Ke) + (En Jn) 
n>wN 
and 


(F’, X Ki) + X +: + X Kr) + X 
say, such that 
P N 
> Ky | | In| 


p=1 n=1 


*| J | denotes the upper Lebesgue measure of any linear set J. We use the notation 
m X L, for our upper measure when we are comparing it with some other measure in 
the product-space: otherwise we use zy, for brevity. 
10 Some of the subsets may of course be empty, but this does not affect the argument. 


se 
415 
in 
m 
ry 
er 
|_| 


148 A. J. WARD. 


and 
P N 
(mF",):|Kp|S (mE'n) | |. 


It is clear that E + E’ is covered by 


[(Fp + F's) x Ky] +> n xX Jn) + > n x 


n>N m>N 


and (since = 0 for EE’ is covered by 


X Ky) +3 (Bu X Jn) + 


p=1 n>wN >N 


Hence 
m(E + E’) + m(EE’) => [m(F,+ F’,) + m(F,F’y)] - | Kp | + 2 
p=1 


) + m(F’y)]- | Kp | + 2 


A> 


N 
<3 +S | | + 2 


n=1 


= m(E) + m(E’) + 4c, 


from which the required result follows at once. We may now define 


m X_L,(E), or m(E), as 
m|[E, X <0, M>] —m|[ X <0, — E]. 

In future we write J, for the interval <0, M>. 

THeEoREM 2. Let E be the ordinate-set of f(x) on HE. Then 

(ii) f(x)dp* — mx L,(E); 

(iii) 4. f(x)dp is the lower measure of E relative to E XK Jo; 

(iv) i f(x) dp is the lower measure of E relative to E + (E)— E) X Jo. 


Let =(2n X Jn) be any covering of E. Write pna(y) = m(E,) if y is in 
Jn, and pin(y) = 0 otherwise. For any yo, those sets for which yo J, must 
cover the section of E by the line y == yo, that is, the set H(f = yo). It follows, 
since m satisfies the condition (c), that 


pn(Yo) = (f = yo) = ml > yo) J. 


= 
‘ 


ne 


THE FAN INTEGRALS INTERPRETED AS MEASURES IN A PRODUCT-SPACE. 149 


M 
Now, for each n, f bn(y)dy = (mE,)-| In|. Hence, if In| is 


finite, > rn(y) must be Lebesgue summable™ in J, and we have 
M M 
(mEn)-| In| pn(y) dy ml E(f > y) 
0 n 0 


It follows that f f du* = mE. 
JE 


On the other hand, given «> 0, we can form a division of Jo by points 

N-1 

B(F > (Yan <f +e 
L is clearly covered by the sets K(f Yn) 0, N— i; 
together with & (y= 0), and so 


mE <f f +. 
E 


Since ¢ is arbitrary, we have proved part (i) of the theorem. 
We remark that it follows, in particular, that m(E X Jo) = M.mE. 
Hence, if / is measurable m relative to H,, then HE X Jy is measurable m 
relative to H, Jo. 
We now turn to part (ii) ; it is required to prove that 


X Jo —E)=M- mE, f dp* =f dp*, 
* Eo 
by (2) 1. 
Now FE, X J, —E is the set of all points (7, y) such that 
0OSy=M ifzisin —E, 


f(t) <ySM if zis in £. 


It is clear that we may add the points [a, f(x) ], for in HE (thus changing 
the last inequality to f(z) Sy = M), without altering the m upper measure. 
We then obtain a set which is congruent with the ordinate-set of g, on Eo, 
being in fact obtained by a reflection of this set in the line y = $M, and which 
is easily seen to have the same upper measure. Hence we obtain the required 
equation 


Eo 


by part (i). 


In particular, 4, (y) must converge for almost all y. 


n 


in 
1st 
vs, 


150 A. J. WARD. 


Parts (iii) and (iv) of the theorem follow similarly from equations (1) 
and (3) of 1. 


3. Theorem 2, part (i), can be viewed in an interesting way as a relation 
between two upper measures. It is natural to define, in the (2, y) space, 
mXJ,(E), the product of m and Jordan measure, as the lower bound 


N 
of > m(L£,)-|Jnj{ for all finite coverings of E by sets E, X Jn, where Jy 


n=1 
is Jordan measurable (in particular J, may be taken as an interval). A proof 
similar to that of Theorem 1 shows that m X J, upper measure satisfies con- 
ditions (a) and (b), though not (c); so that we may speak of lower measure 
m xX J,. It is clear that 
mXJ,(E)Sm X L,(E) Sm X L,(E) X J, (E) 

and that the two upper measures are not the same in general. However, the 
proof of Theorem 2 (i) shows that the m X J, and m X L, upper measures 
of an ordinate-set are equal. 

We now consider the relation of our measure to the Ulam-Hahn product 
measure ?* which we shall denote for the moment by Um. Let X% be the class 
of sets measurable m and ,; the class of sets measurable in the ordinary 
Lebesgue sense in Jo. As remarked above, any set of the form 2 X Jo, where 
E is in X, is measurable m X L, and m X L,(E X Jo) = mE-|J,|. The 
same is obviously true for any set of the form HX J, where J is an interval. 
It follows at once that it remains true for sets of the form VY & J where £ is 


inX and J isin £;. The sets measurable m X L, form an additive class and 
the measure m X L is completely additive in that class. Hence any set E 
measurable (XP) is also measurable m X L,, and m X L,(E) = Um(E),. 
Finally, any subset of a set of zero Um-measure is also, by the above argument, 


of zero m X L, measure, and so we have further: 


Any set E measurable (Xf) is also measurable m XxX L, and 
m L,(E) = Um(E). 


The converse is not in general true, as may be seen by the following 
example. 

Let Ey be the set of real numbers 0 = z= 1 and Jo the set OS y=1; | 
write for HC Ey, m(#) =| 
satisfies the conditions (a), (b) and (c), but that the only sets measurable m 


%, It is easily verified that this upper measure 


128. Saks, Theory of the Integral (Warsaw, 1937), pp. 82-88. We adopt the 
notation there used. 


THE FAN INTEGRALS INTERPRETED AS MEASURES IN A PRODUCT-SPACE. 151 


are, firstly, sets V such that mN =! N | = 0, and secondly their complements 
E,—N. Then all sets in the product-space which are measurable (XP) are 
of the form (ZH, —N) X J +N, where mN =0, J is measurable £1, and N 
is a subset of N * J,. For the sets of this form constitute an additive class 
which includes all sets of the form EF X J where £ is in X and J is in Pi. It is 
also clear that the Um-measure of such a set is equal to |J|. The sets 
measurable (XP) are obtained by forming the sum of any set measurable 
(X#:) and any subset of a set measurable (XP,) with zero Um-measure. 
Thus all sets measurable (XP:) are of the form 
(Eon —N) XJI+N,+N,, 

where V,J are as before, N,; is a subset of N & Jo, and N, is a subset of Ey & Jo, 
say, where | J,|—0. Now consider the diagonal set OS¢ea—y<1. It 
cannot be expressed in the above form and is therefore not measurable (XP:). 
It is however easily verified (covering by the sets n/kSa,yS (n +1) /k, 
n=0,1,:--.k 1) that this set has m * L, measure zero, and is therefore 


measurable m 
4. We now return to the Fan integrals. Theorem 2 shows that the 


integrability (~) of f over is equivalent to the measurability (m 


of the ordinate-set of f on KH, relative to FE & Jo.2°" (There does not, however, 


_ appear to be any such simple interpretation of integrability (w), if 2 is not 


measurable. ) 

In the light of this remark, the additivity of the integral over relatively 
measurable subsets of £* is seen to correspond to the fact that, if F, is 
measurable m relative to then * is measurable m relative to Jo. 


The ordinate-sets of f on the various sets 2, are ‘separated’ by being con- 


tained in the relatively measurable sets En X& Jo. 
that a 


necessary and sufficient condition that the ordinate-set of f on EF should be 


We next consider Theorem 11 SF, which states—in our language 


measurable m relative to H X Jo is that the function f should be measurable 
m relative to 2. The necessary condition is now seen to be a special case of 
the following analogue of Fubini’s theorem. 

THEOREM 3. Let E be any set measurable m. For any y, let E(y) be 
the set of x such that (a, y) is in E. Then E(y) is measurable m for almost 


f, mE (y) dy = mE. 
Jo 


*8 In fact, it is equivalent to measurability m xX J,, which is stronger. 
1¢Theorem 2 SF and the corresponding theorem for u* integrals. 
5 


all y, and 


) 
id 
Jn 
of 
n- 
ire 
he 
res = 
ict 
‘he 
val. 
is 
ind 
E 
nt, 
and § 
ing 
sure 
e m 


152 A. J. WARD. 


The proof is similar to that of Theorem 2. Since mE + m(E, X Jo. —E) 
= M-mE,, we can, given > 0, find sequences of sets K Jn) and 


n 
> (4'n X J’n), covering E and Ey X J, — E, respectively, such that 
n 


(6) m(Ln) !In| <mE+ 
and 
(7) m | | <M “mM E, mE €. 


n 
Define as in Theorem 2, and similarly p’»(y) for the set E’n J’n. 


As in Theorem 2, we have, for any y, 


(8) Spun(y) = mL (y) 
and 
(9) Sun’ (y) = m[ — E(y)] = mE, — mE (y). 


Now  Spn(y) and Sy’n(y) are, as before, finite almost everywhere in J, and 
Lebesgue summable. Denoting by fdA* the Fan integral with respect to 


Lebesgue measure, we now have, from (8) and (9), 


(En) In | Spun(y)dy = ( Spin (y) da* =f mE (y)dr* 
Jo 7 Jo Jo 


and 


mE (y) dx =f — Sp'n(y) |dA=M- mE, —f Spn’ (y) dy 
0 Jo e Jo 


= M-mE, — | |. 


From (6) and (7) we now obtain 


m(E) —e« <f mE(y)dr mE (y)dx mE (y)dvA* < m(E) +. 


Since ¢« is arbitrarily small, mH#(y) must be integrable (A). It is therefore 
measurable (Z,) as a function of y,° and the Fan integrals reduce to 
ordinary Lebesgue integrals. Similarly mH#(y) is measurable (L,). We 


therefore have 


mE (y) dy -{ fin E (y) dy = 
Jo Jo 


from which we see that mH (y) = mH(y) for almost all y.*® 


5. The conditions (b) and (c), satisfied by m upper measure, show at 


15 Theorem 11 SF. 
16The above proof is essentially the same as a standard proof of the ordinary 
We have, however, given the proof 


Fubini theorem (C. Carathéodory, loc. cit., 621 ff. 
in full as it affords an interesting application of the Fan integrals. 


THE FAN INTEGRALS INTERPRETED AS MEASURES IN A PRODUCT-SPACE. 153 


once that if 32, — H, and f(a) = 0 for all x, then, whether the family of 


sets (H,) is finite or enumerable, 


s( faq = f dat. 
En E 


The corresponding theorem for fda (Theorem 3 SF) is less immediate. 
It can, however, be obtained rather more shortly than in SF by applying the 
following lemma (SF, 323) to m upper measure. 


LemMMA. Jf E,,E.,- + are mutually disjoint sets, then 


P 2P P P 
1 + Esn) —m(>E;) = > m(E.,) — m > Eon. 
n=1 i=1 n=1 n=1 

We need consider only the case when the sets #, are non-overlapping. Let 
E.n-, denote the ordinate-set of f on and let Eon = En Jo — Eon. The 


lemma then gives at once, for a finite sum of sets Py, 


P P 
> (M-mE,— f dz] 
n=1 En n=1 LEn 
= M m E,, M m ( p Ey) 
n=1 n=1 


that is, 


f 


The passage to the limit to obtain the corresponding result when we have 
an infinite sequence of sets #, is, in general, valid only when the upper 


measures m and m are regular—a case which we consider later.‘7 The theorem 


is however always true if }} m#, is finite: for if we choose p so large that 


n=1 
OO 
> mE, <« (say), then m( > En) < € and so 
n=p+1 n=p+1 
: ed P 
J x n=1e/ Ey 
n=1 n=1 n=p+1 


from which the result follows. 

17 The following example shows that the theorem may fail for an infinite sequence 
of sets. Let E,, consist of two infinite sequences of points (A,) and (B,). Let 
m(H) = 2 if contains an infinite number of points A,, and = 1 for all other 
non-empty sets, the empty set of course having zero measure. Let f(#) =1 at each 
point A,, and f(x) = 0 at each point B., and let E,, consist of the two points A, and 
B.. It is easily verified that m satisfies conditions (a), (b) and (c), and that 


f du = 0 for each n, while f fdu=1. 
En 


En 
n=1 


154 A. J. WARD. 


6. We now consider Theorem 6 SF, or rather the equivalent theorem 


This has an obvious similarity to the condition (b) satisfied by an upper- 
measure function, but it cannot be deduced simply by applying (b) to either 


m X L, or m X J, upper measure. It can, however, be expressed as a simple 
property of an upper measure m X J,* which we now define. 

Let E be any set in the (z,y) space, and, for given 2, let (a) denote 
the set of y such that (x,y) «E. Let (/,) be any finite sequence of sets in 
the z-space, and denote by cn(a) the characteristic function of Hy». We shall 
say that the sets E», associated with the heights In, form a skew covering of E 
if, for each 2, there exists in the y-space a set intervals J, (depending on =) 
of respective lengths I,c,(a), which cover H(z). We define m X J,*(E) as 
the lower bound of 3/, - m(H,) for all such finite skew coverings. It is at 
once clear that 
m xX J,*(E) = m X Jd,(E) 
and that 

mXJd,*(E+ E’) =m xX J,*(E) +m X J,*(E’).® 


TueorEM 4. Jf E is the ordinate-set of f on E, then m X J,*(E) 


Suppose given any skew covering of E. By an arbitrarily small increase 
in the heights /, we may suppose them all rational: let 7 be the largest number 
such that the numbers /,// are all integral. We may replace each set Fy, 
associated with /,, by the set #, repeated /,/1 times, associated each time with 
the height J. That is, it is sufficient to consider finite skew coverings 


E,, H2,: - -, Hy in which all the heights are the same, say /. We then have, 
clearly, =-f(z) for all of Now if #, DF, it is 


easily seen that the sets & <0,1>, En <(n—1)I1, nly,: 
form a finite covering of the ordinate-set of f(x); that is, m x J,(E) 
= I3m(E,). 

Suppose, however, that D Write = FE, + E., = E,E., with 
characteristic functions c’; and c’, respectively. Then c¢,(x) + 
=c,(x) +c’ (x) for all x, so that we may replace by E’;, and still 
have a skew covering of E with heights 1. We now have £’, > EK’, and 
mE’, mE’,S mE, + mE, If DE; we can similarly replace E’,, 


18 We do not examine the question whether m X J,* satisfies (b). 


. 


THE FAN INTEGRALS INTERPRETED AS MEASURES IN A PRODUCT-SPACE. 155 


by #’,-+ FE, and H#’,E, respectively: continuing this process, first with all 
pairs of integers (1,7), then with all pairs (2,7) such that 1 > 2, and so on, 


we finally obtain a skew covering by sets #,”, H,”- - - (say), with heights J, 
such that H,” E,” and = 3mE,. Thus we have again 


= 
We deduce at once that m XK J,(E) = m X J,*(E) and therefore 
m X J,*(E) =m X Jd,(E). 


Now consider two functions f(z) and g(x). Let F and G be the ordinate- 
sets of f and g on F, and G, the set ySf(x) + g(x), re H. Since, 
for any 2, the sets G(x) and G,(x) are congruent (except for a single point), 
any skew covering of G also provides a skew covering of G,, so that 


m X J,*(G) =m X J1*(G,).. 


We then have 
f (f + =m XJy*(F + G,) Sm Xdi*(F) + m 
E 


m X J,*(F) mx J,*(G) f dp* +f g dp*. 
E E 


Suppose now that f is measurable (m) relative to , so that F is measurable 
(m X J,) relative to HX Jo. Then 
(f+ 9)dp* =m =m XJd,(F) + m 
JHE 


(since F is measurable) 


+m XJ,*(G,) = f +f g dp* 
E 


as before. Theorem 5 SF follows at once. 


7. From now on we supose that the upper measure m is regular. In this 
case the Fan integrals can be expressed in terms of integrals of. the Lebesgue 
type, and some of the theorems of SF can thus be more easily proved. We also 
obtain some new theorems. 


THEOREM 5. If the upper measure m is regular, then the upper measure 
m is regular, and the sets measurable m coincide with the sets measurable 


156 A. J. WARD. 


Let E be any set in the (z,y) space, and (e:) a sequence of positive 
numbers tending to zero. For each 7 there exists a sequence of sets Ey’ K Jn‘, 
covering E, such that J,‘ is measurable (1) and 


D -| Int | < m(E) +e. 


Let H,* be an equimeasurable cover of H,‘, for m upper measure. Then 
— X is measurable and satisfies 
n 


m(Hi) <3 mH! | <m(E) + «. 


Then H = J] H; is measurable (X,) and therefore certainly measurable m; 
it covers E and satisfies mH <= mE. 

If E is measurable m, we can find similarly a set measurable (XP:) 
covering Hy) X J, — E; that is, we can find a set K, measurable (X#,) and 
contained in E, such that mK =mE—mH. E—K is then a subset of 
H—K, which is measurable (Xf:) and of zero measure; hence E is 


measurable (XP 1). 


Now suppose that E is the ordinate-set of a function f(x) on a set E. 
We can then, as in Theorem 2, take for the sets FE,‘ & Jn‘ the sets E(f > yn‘) 
X <yn', Ynsi*>, together with (y=0), where < yt: < yy; 
= WM is a suitable division of Jo. Since we may suppose 
that H,,,‘C H,* C H, for all i and n, where H is a fixed equimeasurable 
cover of H. Then H# is the ordinate-set of a function F(x) defined on H 
and measurable m, and H is the ordinate-set of the function F(x) = bound 
F(x), which is again measurable m and defined on H. Since H covers E and 
m(H) = m(E), we have F(x) >f(x) for x in F, and 


(10) P@an— — f f(x) dp*, 


where the first integral is of Lebesgue type, constructed by the use only of 
measurable sets. 

The function F(x) is not uniquely determined, as there is a certain 
arbitrariness in the choice of €;, yn‘ and H,‘. However, as we shall now show, 
any two determinations of /(z) differ only on a set of zero m-measure. 


THEOREM 6. (i) If G(x) is measurable m and G(x) =f(x) on E, 
then G(x) = F(x) almost everywhere (m) on H.® 


1° That is, except for a set of zero mi-measure. We note that a function measurable 
m and defined everywhere on HE must be defined almost everywhere (m) on the equi- 


measurable cover, H, of E. 


THE FAN INTEGRALS INTERPRETED AS MEASURES IN A PRODUCT-SPACE. 157 


(ii) For any e>0, m[E(f > F—-«)] = mb. 


(iii) If Fy (x) is measurable m, F,(x) = f(x) on E, and, for any « > 0, 
m[E(f > | = mE, then (x) almost everywhere (m) on 


(iv) For any y, m[H(F > y))=—m[E(F > =m[E(f > y)]. 


Proof. (i) Suppose that, if possible, m[H(G< F)]>0. For sufficiently 
small « > 0, we have m[H(G < F-—e)] >0. The function F(z) 
=min|[ F(x), | is measurable m and F,(2) = f(x) on The ordinate- 
set of F';(x) on H includes the ordinate-set of f(z) on FE and so 


(11) din— F, = Jeane. 


On the other hand, F,(z) = F(z) everywhere and m[H(F, < F—e)] 
positive, say equal to 7. Hence 


(12) f (x)dmS f, F(x)dm — en. 
H H 


Combining (10), (11) and (12) we have a contradiction. 

As a corollary we see that if F,(2) is any other determination of F(z), 
we have, almost everywhere (m) on H, F,(2) = F(x) and F(r) = F,(@); 
that is, = F(z). 

We may say that F(z) is effectively the smallest function, measurable m, 
which is, everywhere on FE, greater than or equal to f(x). Accordingly we 
shall call it the appro. ge _s function of f(x) on E, and denote it by 
A*(f, #,m;a),or simply A*(f) or A4*(x) when there is no risk of confusion.” 


(ii). Let #, denote E(f > F —e), and let H, be an equimeasurable cover 
of E.; we may suppose H,CH. If mE. < mE, then m(H —H,) > 0. 
Writing G(x) =F (x) on H, and G(r) = F(x) —e on H — H,, we obtain 
a contradiction with (i). 


(iii). By (i), m[H(F, < F)] =0. For any > 0, we have 
m[H(F > F,—e)]= bid = mE, and so, since F and F, are 
measurable, m[H(F F, — = As is arbitrary, this gives 
in[H(F < F,)]=0. 


(iv). If, for any yo, mLH(F > y)] > m[E(f > yo) ], then there exists 
ane > 0 such that > y+ «)] —m[E(f > yo) ] =a > 0, say, since 


20Cf, A. J. Ward, “On the differential structure of real functions,’ Proceedings 
of the London Mathematical Society, (2), vol. 39 (1935), pp. 339-362 (especially 341), 
where a rather different treatment is given in the special case of Lebesgue measure. 


= 
| 


158 A. J. WARD. 


m[H(F > y)| =lim m[H(F > y+ 6)]. Accordingly m[H(F > y)] 
€>+0 
—m[E(f >y)|] 2a for ySySy+«. It follows that 


»M 
f, > lay — > lay 


which contradicts (10). 

It is clear that we could have defined the function F(z) = A*(f, Z, m; 2) 
without any reference to m measure.” It is not difficult to prove Theorem 6 
directly from such a definition without using our previous work. ‘The 
equalities (10) then follow at once from Theorem 6, (iv). The remaining 
theorems of this paper might therefore have been proved independently of 1-6. 

We can construct in a corresponding way the approximate minimal 


function A-(f, such that f A.dm. If f(x) is integrable 
JE He 


on we then have 


7. 


(13) f A:dm == f f dp f f 
a E E JH 


Now A* =f = A. on FE, so that m[ H(A* = A-)] = mE = mH ; accordingly, 
since A* and As are measurable, m[H(A* < A+)] 0. It now follows from 
(13) that A* = A. almost everywhere on H, and so f = A* almost everywhere 
on #. We thus find again the theorem ** that if f is relatively measurable 
on #, then there exists a measurable function which coincides with f on EF. 


8. We can now give very simple proofs of Theorems 3 SF, 6 SF and 
18 SF, using the standard properties of integrals of measurable functions. 
Let us consider first Theorem 3 SF. For each n, denote by Hy an equi- 
measurable cover of £,,; then = 3H, is an equimeasurable cover of = S/n. 
Let F(z) = A:(f, #; x), defined on H, and F, (x) = A:(f, Zn; 7), defined on 
H,. Since F(x) is measurable’ and F(x) =f(a#). on Ey», we have F(z) 
= Fn(x) almost everywhere on H,,, as in Theorem 6 (i). Since F(z) is non- 
negative and the sets H, are measurable, but may overlap, we have 


*1 From now on we use only m-measure: the words ‘measurable’ and ‘ almost 


everywhere’ always refer to this measure. 
*2 If K is an equimeasurable kernel of H (that is, KC HE and mK = mK = mB), 


it follows from 1 that A*(f,K)dm and f fdp* = 
E K E 


*8 J. C. Burkill and U. S. Haslam-Jones, “ Relative measurability and the derivates 
of non-measurable functions,” Quarterly Journal of Mathematics (Oxford), vol. 4 
(1933), pp. 233-239, Theorem 7. 


THE FAN INTEGRALS INTERPRETED AS MEASURES IN A PRODUCT-SPACE. 159 


f P(a)din F(x)dnm <3 
H Hn Hn 


f(x) dp. 


This applies whether the number of sets /, is finite or enumerable. 

To prove Theorem 6 SF, let F(x) = A.(f, and G(x) = A+(g, £32). 
Then F(x) + G(x) is measurable and- F(z) + G(r) Sf(r) + on 
hence F(x) + G(x) = A* (f+ g,H;z) almost everywhere on H. Accordingly 


we have 


f fant f Gan — f (F + @)am 
E E H H H 


< f (f+ 
H E 


To prove Theorem 18 SF we have, similarly, A+(fn, #32) Sfn(x) on £, 


and so lim A-(fn, S lim fn(a). The function on-the left is measurable 


that is, 


and so lim A+(fn, S lim fn, x), almost everywhere on H. The 


n->Co n->CO 


result now follows by integration. 
9. We now state two new theorems on the Fan integral. 


THEOREM 7. Given any set E (on which f(x) 20) and any « > 0, there 
exists a set S = S(e) C E such that 


8 E 

8 


Let F(x) = A*(f,#,x) and let S, be the set on which f(x) = F(z) 
—[e/mE]. Since mS = mE = mH we have by a remark in 1, 


f. du [F(2) —e/mE] dz = [F (2) 


= J, [F (2) —e/mE|dn = f f dp* —e. 
E 
For any »<«, = S, Sy. Accordingly 


as above. Since is arbitrary, f dp* = J f dp*. 
Se E 


and 


A. J. WARD. 


THEOREM 8.74 A necessary and sufficient condition that J (f + g)dp* 
E 
= J f dp* + f g du* (both functions being non-negative), is that, given : 
E 
any «>0, there exists a set S=S,CE such that mS = mbB, f fdp | 
> f du* —e, and g dp* = g ap*. 
E E 
(i) Sufficiency. If, for any « > 0, such a set S, exists, we have 
E Se Se Se 


and since « is arbitrary the result follows at once. 


(ii) Necessity. We have, almost everywhere on H, A*(f + 9) S A*(f) 
+ A*(g) (the corresponding result for A+ has been proved in 8). If § 


e 


so that A*(f + g) = A*(f) + A*(g) almost everywhere on H. Let S= 8, 
be the set on which f+ g > A*(f+g)—(e/mE) and also A*(f+qQ) 
= A*(f) + A*(g); then mS,—mEH. Since A*f=f and A*g=qg on 
we must have, on 8, f > A*(f) — (e/mE) and g > A*(g) — (e/mE). Just 


as in Theorem 7, we deduce that 
S JE s E 
with similar results for gq at the same time. 


VicToRIA UNIVERSITY, 
MANCHESTER, ENGLAND. 


24 Tt is easily shown by an example that the e cannot, in general, be omitted; it is 7 
not always possible to find 8S (CC #, with mS C mB, on which f is integrable (“) and q 


= f A similar remark applies to Theorem 7. 
8 E 


160 

| 
then 


\ 
) 

t 


